Skip to main content

Showing 101–150 of 1,336 results for author: Huang, M

.
  1. arXiv:2312.08880  [pdf, other

    cs.CV

    GenDet: Towards Good Generalizations for AI-Generated Image Detection

    Authors: Mingjian Zhu, Hanting Chen, Mouxiao Huang, Wei Li, Hailin Hu, Jie Hu, Yunhe Wang

    Abstract: The misuse of AI imagery can have harmful societal effects, prompting the creation of detectors to combat issues like the spread of fake news. Existing methods can effectively detect images generated by seen generators, but it is challenging to detect those generated by unseen generators. They do not concentrate on amplifying the output discrepancy when detectors process real versus fake images. T… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  2. arXiv:2312.07937  [pdf, other

    cs.CV

    BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics

    Authors: Wenqian Zhang, Molin Huang, Yuxuan Zhou, Juze Zhang, **gyi Yu, **gya Wang, Lan Xu

    Abstract: The recently emerging text-to-motion advances have spired numerous attempts for convenient and interactive human motion generation. Yet, existing methods are largely limited to generating body motions only without considering the rich two-hand motions, let alone handling various conditions like body dynamics or texts. To break the data bottleneck, we propose BOTH57M, a novel multi-modal dataset fo… ▽ More

    Submitted 10 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted to CVPR 2024

  3. The FAST all sky HI survey (FASHI): The first release of catalog

    Authors: Chuan-Peng Zhang, M. Zhu, P. Jiang, C. Cheng, J. Wang, J. Wang, J. -L. Xu, X. -L. Liu, N. -P. Yu, L. Qian, H. Yu, M. Ai, Y. **g, C. Xu, Z. Liu, X. Guan, C. Sun, Q. Yang, M. Huang, Q. Hao, FAST Collaboration

    Abstract: The FAST All Sky HI survey (FASHI) was designed to cover the entire sky observable by the Five-hundred-meter Aperture Spherical radio Telescope (FAST), spanning approximately 22000 square degrees of declination between -14 deg and +66 deg, and in the frequency range of 1050-1450 MHz, with the expectation of eventually detecting more than 100000 HI sources. Between August 2020 and June 2023, FASHI… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 22 pages, 12 figures, published in SCPMA. All catalogs are available at https://zcp521.github.io/fashi and https://fast.bao.ac.cn/cms/article/271/

    Journal ref: Sci. China-Phys. Mech. Astron. 67, 219511 (2024)

  4. arXiv:2312.02720  [pdf, other

    cs.LG cs.AI

    Towards the Inferrence of Structural Similarity of Combinatorial Landscapes

    Authors: Mingyu Huang, Ke Li

    Abstract: One of the most common problem-solving heuristics is by analogy. For a given problem, a solver can be viewed as a strategic walk on its fitness landscape. Thus if a solver works for one problem instance, we expect it will also be effective for other instances whose fitness landscapes essentially share structural similarities with each other. However, due to the black-box nature of combinatorial op… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  5. arXiv:2312.02161  [pdf, other

    cs.IT cs.NE

    Efficient LDPC Decoding using Physical Computation

    Authors: Uday Kumar Reddy Vengalam, Andrew Hahn, Yongchao Liu, Anshujit Sharma, Hui Wu, Michael Huang

    Abstract: Due to 5G deployment, there is significant interest in LDPC decoding. While much research is devoted on efficient hardwiring of algorithms based on Belief Propagation (BP), it has been shown that LDPC decoding can be formulated as a combinatorial optimization problem, which could benefit from significant acceleration of physical computation mechanisms such as Ising machines. This approach has so f… ▽ More

    Submitted 20 September, 2023; originally announced December 2023.

  6. arXiv:2312.01814  [pdf

    physics.optics

    Non-saturation intensity dependence of anisotropic third-order optical nonlinearity approaching the damage threshold in ZnSe and GaP

    Authors: Jianpeng Ye, Min Huang

    Abstract: The intensity dependence of anisotropic third-order optical nonlinearity approaching the damage threshold in ZnSe and GaP crystals is studied by the femtosecond laser pump-probe measurements, which can greatly reduce the laser-matter interaction length and thus realize the probing of orientation-dependent characteristics of nonlinear optical phenomena in the near-damage-threshold intensity regime… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 15 pages, 13 figures

  7. arXiv:2312.01733  [pdf, other

    cond-mat.mtrl-sci

    Metastability and anharmonicity enhance defect-assisted nonradiative recombination in low-symmetry semiconductors

    Authors: Menglin Huang, Shanshan Wang, Shiyou Chen

    Abstract: Strong nonradiative recombination has been observed in quasi-one-dimensional antimony selenide, which runs counter to the simple intuition that claims high defect tolerance exists in semiconductors with antibonding state in the valence band and bonding state in the conduction band. Here we reveal such a defect intolerance actually stems from the richness of structural metastability and vibrational… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  8. arXiv:2311.18743  [pdf, other

    cs.CL cs.AI cs.LG

    AlignBench: Benchmarking Chinese Alignment of Large Language Models

    Authors: Xiao Liu, Xuanyu Lei, Shengyuan Wang, Yue Huang, Zhuoer Feng, Bosi Wen, Jiale Cheng, Pei Ke, Yifan Xu, Weng Lam Tam, Xiaohan Zhang, Lichao Sun, Hongning Wang, **g Zhang, Minlie Huang, Yuxiao Dong, Jie Tang

    Abstract: Alignment has become a critical step for instruction-tuned Large Language Models (LLMs) to become helpful assistants. However, effective evaluation of alignment for emerging Chinese LLMs is still significantly lacking, calling for real-scenario grounded, open-ended, challenging and automatic evaluations tailored for alignment. To fill in this gap, we introduce AlignBench, a comprehensive multi-dim… ▽ More

    Submitted 5 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

  9. arXiv:2311.18702  [pdf, other

    cs.CL cs.AI

    CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation

    Authors: Pei Ke, Bosi Wen, Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

    Abstract: Since the natural language processing (NLP) community started to make large language models (LLMs) act as a critic to evaluate the quality of generated texts, most of the existing works train a critique generation model on the evaluation data labeled by GPT-4's direct prompting. We observe that these models lack the ability to generate informative critiques in both pointwise grading and pairwise c… ▽ More

    Submitted 26 June, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted by ACL 2024 (Main Conference)

  10. arXiv:2311.17391  [pdf, other

    cs.CL

    Unveiling the Implicit Toxicity in Large Language Models

    Authors: Jiaxin Wen, Pei Ke, Hao Sun, Zhexin Zhang, Chengfei Li, **feng Bai, Minlie Huang

    Abstract: The open-endedness of large language models (LLMs) combined with their impressive capabilities may lead to new safety issues when being exploited for malicious use. While recent studies primarily focus on probing toxic outputs that can be easily detected with existing toxicity classifiers, we show that LLMs can generate diverse implicit toxic outputs that are exceptionally difficult to detect via… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Main Conference

  11. arXiv:2311.16832  [pdf, other

    cs.CL cs.AI

    CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

    Authors: **feng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wen**g Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

    Abstract: In this paper, we present CharacterGLM, a series of models built upon ChatGLM, with model sizes ranging from 6B to 66B parameters. Our CharacterGLM is designed for generating Character-based Dialogues (CharacterDial), which aims to equip a conversational AI system with character customization for satisfying people's inherent social desires and emotional needs. On top of CharacterGLM, we can custom… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Work in progress

  12. arXiv:2311.14283  [pdf

    physics.geo-ph

    Strong Interference HVSR Data Processing and Denoising: HVSR Curve Reconstruction Method based on UPEMD

    Authors: Bingxuan Song, Fuxing Han, Yubei Chen, Linjun Wu, Mengting Huang, Yanjie Pan

    Abstract: Urban areas pose a challenge for the application of the H/V method due to a high degree of artificial noise. The existing methods fall short in reducing the noise of strong interference data. To solve this issue, a new approach called the HVSR curve reconstruction method is introduced in this paper. The method employs the UPEMD technique to analyze the data component, and the extracted signal is e… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  13. arXiv:2311.14014  [pdf, other

    cs.LG

    On the Hyperparameter Loss Landscapes of Machine Learning Models: An Exploratory Study

    Authors: Mingyu Huang, Ke Li

    Abstract: Previous efforts on hyperparameter optimization (HPO) of machine learning (ML) models predominately focus on algorithmic advances, yet little is known about the topography of the underlying hyperparameter (HP) loss landscape, which plays a fundamental role in governing the search process of HPO. While several works have conducted fitness landscape analysis (FLA) on various ML systems, they are lim… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: 31 pages, 15 figures, 12 tables

  14. arXiv:2311.12256  [pdf

    cond-mat.mes-hall

    Local control of a single nitrogen-vacancy center by nanoscale engineered magnetic domain wall motions

    Authors: Nathan J. McLaughlin, Senlei Li, Jeffrey A. Brock, Shu Zhang, Hanyi Lu, Mengqi Huang, Yuxuan Xiao, **gcheng Zhou, Yaroslav Tserkovnyak, Eric E. Fullerton, Hailong Wang, Chunhui Rita Du

    Abstract: Effective control and readout of qubits form the technical foundation of next-generation, transformative quantum information sciences and technologies. The nitrogen-vacancy (NV) center, an intrinsic three-level spin system, is naturally relevant in this context due to its excellent quantum coherence, high fidelity of operations, and remarkable functionality over a broad range of experimental condi… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 13 pages, 5 figures

  15. arXiv:2311.11540  [pdf

    cond-mat.supr-con

    Prominent Josephson tunneling between twisted single copper oxide planes of Bi$_2$Sr$_{2-x}$LaxCuO$_{6+y}$

    Authors: Heng Wang, Yuying Zhu, Zhonghua Bai, Zechao Wang, Shuxu Hu, Hong-Yi Xie, Xiaopeng Hu, Jian Cui, Miaoling Huang, Jianhao Chen, Ying Ding, Lin Zhao, Xinyan Li, Qinghua Zhang, Lin Gu, X. J. Zhou, **g Zhu, Ding Zhang, Qi-Kun Xue

    Abstract: Josephson tunneling in twisted cuprate junctions provides a litmus test for the pairing symmetry, which is fundamental for understanding the microscopic mechanism of high temperature superconductivity. This issue is rekindled by experimental advances in van der Waals stacking and the proposal of an emergent d+id-wave. So far, all experiments have been carried out on Bi$_2$Sr$_2$CaCu$_2$O$_{8+x}$ (… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 32 pages, 5 figures

    Journal ref: Nature Communications 14, 5201 (2023)

  16. arXiv:2311.09532  [pdf, other

    cs.CR

    LightEMU: Hardware Assisted Fuzzing of Trusted Applications

    Authors: Haoqi Shan, Sravani Nissankararao, Yujia Liu, Moyao Huang, Shuo Wang, Yier **, Dean Sullivan

    Abstract: Trusted Execution Environments (TEEs) are deployed in many CPU designs because of the confidentiality and integrity guarantees they provide. ARM TrustZone is a TEE extensively deployed on smart phones, IoT devices, and notebooks. Specifically, TrustZone is used to separate code execution and data into two worlds, normal world and secure world. However, this separation inherently prevents tradition… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted by IEEE International Symposium on Hardware Oriented Security and Trust (HOST'2024)

  17. arXiv:2311.09096  [pdf, other

    cs.CL

    Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

    Authors: Zhexin Zhang, Junxiao Yang, Pei Ke, Fei Mi, Hongning Wang, Minlie Huang

    Abstract: While significant attention has been dedicated to exploiting weaknesses in LLMs through jailbreaking attacks, there remains a paucity of effort in defending against these attacks. We point out a pivotal factor contributing to the success of jailbreaks: the intrinsic conflict between the goals of being helpful and ensuring safety. Accordingly, we propose to integrate goal prioritization at both tra… ▽ More

    Submitted 12 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: ACL 2024 Main Conference

  18. arXiv:2311.08896  [pdf, other

    cs.CL

    HeLM: Highlighted Evidence augmented Language Model for Enhanced Table-to-Text Generation

    Authors: Junyi Bian, Xiaolei Qin, Wuhe Zou, Mengzuo Huang, Congyi Luo, Ke Zhang, Weidong Zhang

    Abstract: Large models have demonstrated significant progress across various domains, particularly in tasks related to text generation. In the domain of Table to Text, many Large Language Model (LLM)-based methods currently resort to modifying prompts to invoke public APIs, incurring potential costs and information leaks. With the advent of open-source large models, fine-tuning LLMs has become feasible. In… ▽ More

    Submitted 27 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  19. arXiv:2311.08714  [pdf, other

    hep-th

    Schur indices for $\mathcal{N}=4$ super-Yang-Mills with more general gauge groups

    Authors: Bao-ning Du, Min-xin Huang, Xin Wang

    Abstract: We study the unflavored Schur indices in the $\mathcal{N}=4$ super-Yang-Mills theory for the $B_n,C_n,D_n, G_2$ gauge groups. We explore two methods, namely the character expansion method and the Fermi gas method, to efficiently compute the $q$-series expansion of the Schur indices to some high orders. Using the available data and the modular properties, we are able to fix the exact formulas for t… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 30 pages

    Report number: USTC-ICTS/PCFT-23-33, KIAS-Q23022

  20. arXiv:2311.04155  [pdf, other

    cs.CL

    Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

    Authors: Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang

    Abstract: Large language models (LLMs) have shown impressive success in various applications. However, these models are often not well aligned with human intents, which calls for additional treatments on them; that is, the alignment problem. To make LLMs better follow user instructions, existing alignment methods primarily focus on further training them. However, the extra training of LLMs is usually expens… ▽ More

    Submitted 21 June, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted to ACL 2024

  21. arXiv:2311.03493  [pdf

    cond-mat.mtrl-sci

    Dimensionality crossover to 2D vestigial nematicity from 3D zigzag antiferromagnetism in an XY-type honeycomb van der Waals magnet

    Authors: Zeliang Sun, Gaihua Ye, Mengqi Huang, Chengkang Zhou, Nan Huang, Qiuyang Li, Zhipeng Ye, Cynthia Nnokwe, Hui Deng, David Mandrus, Zi Yang Meng, Kai Sun, Chunhui Du, Rui He, Liuyan Zhao

    Abstract: Fluctuations and disorder effects are substantially enhanced in reduced dimensionalities. While they are mostly considered as the foe for long-range orders, fluctuations and disorders can also stimulate the emergence of novel phases of matter, for example, vestigial orders. Taking 2D magnetism as a platform, existing efforts have been focused on maintaining 2D long-range magnetic orders by suppres… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  22. arXiv:2311.02804  [pdf, ps, other

    cs.CC math.NT

    Last fall degree of semi-local polynomial systems

    Authors: Ming-Deh A. Huang

    Abstract: We study the last fall degrees of {\em semi-local} polynomial systems, and the computational complexity of solving such systems for closed-point and rational-point solutions, where the systems are defined over a finite field. A semi-local polynomial system specifies an algebraic set which is the image of a global linear transformation of a direct product of local affine algebraic sets. As a specia… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  23. arXiv:2311.02400  [pdf, other

    cs.CY

    From Plate to Production: Artificial Intelligence in Modern Consumer-Driven Food Systems

    Authors: Weiqing Min, Pengfei Zhou, Leyi Xu, Tao Liu, Tianhao Li, Mingyu Huang, Ying **, Yifan Yi, Min Wen, Shuqiang Jiang, Ramesh Jain

    Abstract: Global food systems confront the urgent challenge of supplying sustainable, nutritious diets in the face of escalating demands. The advent of Artificial Intelligence (AI) is bringing in a personal choice revolution, wherein AI-driven individual decisions transform food systems from dinner tables, to the farms, and back to our plates. In this context, AI algorithms refine personal dietary choices,… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  24. arXiv:2311.01357  [pdf, other

    cs.CV

    Robust Identity Perceptual Watermark Against Deepfake Face Swap**

    Authors: Tianyi Wang, Mengxiao Huang, Harry Cheng, Bin Ma, Yinglong Wang

    Abstract: Notwithstanding offering convenience and entertainment to society, Deepfake face swap** has caused critical privacy issues with the rapid development of deep generative models. Due to imperceptible artifacts in high-quality synthetic images, passive detection models against face swap** in recent years usually suffer performance dam** regarding the generalizability issue. Therefore, several s… ▽ More

    Submitted 15 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: In peer review

  25. arXiv:2311.00944  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    Stochastic Smoothed Gradient Descent Ascent for Federated Minimax Optimization

    Authors: Wei Shen, Minhui Huang, Jiawei Zhang, Cong Shen

    Abstract: In recent years, federated minimax optimization has attracted growing interest due to its extensive applications in various machine learning tasks. While Smoothed Alternative Gradient Descent Ascent (Smoothed-AGDA) has proved its success in centralized nonconvex minimax optimization, how and whether smoothing technique could be helpful in federated setting remains unexplored. In this paper, we pro… ▽ More

    Submitted 18 April, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  26. arXiv:2311.00397  [pdf, other

    cs.CV

    Towards Omni-supervised Referring Expression Segmentation

    Authors: Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun

    Abstract: Referring Expression Segmentation (RES) is an emerging task in computer vision, which segments the target instances in images based on text descriptions. However, its development is plagued by the expensive segmentation labels. To address this issue, we propose a new learning task for RES called Omni-supervised Referring Expression Segmentation (Omni-RES), which aims to make full use of unlabeled,… ▽ More

    Submitted 27 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

  27. arXiv:2311.00367  [pdf, other

    cs.CL cs.AI

    Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition

    Authors: Chenxu Wang, ** Jian, Mu Huang

    Abstract: Implicit Discourse Relation Recognition (IDRR), which infers discourse relations without the help of explicit connectives, is still a crucial and challenging task for discourse parsing. Recent works tend to exploit the hierarchical structure information from the annotated senses, which demonstrate enhanced discourse relation representations can be obtained by integrating sense hierarchy. Neverthel… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: This paper is accepted by the EMNLP 2023 Main Conference

  28. arXiv:2310.20194  [pdf, other

    hep-ph nucl-th

    Evolution of topological charge through chiral anomaly transport

    Authors: Zilin Yuan, An** Huang, Wen-Hao Zhou, Guo-Liang Ma, Mei Huang

    Abstract: Built upon the state-of-the-art model a multiphase transport (AMPT), we develop a new module of chiral anomaly transport (CAT), which can trace the evolution of the initial topological charge of gauge field created through sphaleron transition at finite temperature and external magnetic field in heavy ion collisions. The eventual experimental signals of chiral magnetic effect(CME) can be measured.… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 7 pages, 6 figures

  29. arXiv:2310.15452  [pdf, ps, other

    math.FA

    Riesz type theorems for $κ$-pluriharmonic map**s, invariant harmonic quasiregular map**s and harmonic quasiregular map**s

    Authors: Shaolin Chen, Manzi Huang

    Abstract: The main purpose of this paper is to develop some methods to improve and generalize the main results in a recent paper by Liu and Zhu (Adv. Math., 2023, i.e., \cite{L-Z}). The paper consists of two parts. In the first part, we discuss the Riesz type theorem in the setting of $n$-dimensional complex spaces for all $n\geq 1$. In this part, we first introduce the family of $κ$-pluriharmonic map**s… ▽ More

    Submitted 29 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 29 pages

    MSC Class: 30H10; 30C62

  30. arXiv:2310.14564  [pdf, other

    cs.CL

    Language Models Hallucinate, but May Excel at Fact Verification

    Authors: Jian Guan, Jesse Dodge, David Wadden, Minlie Huang, Hao Peng

    Abstract: Recent progress in natural language processing (NLP) owes much to remarkable advances in large language models (LLMs). Nevertheless, LLMs frequently "hallucinate," resulting in non-factual outputs. Our carefully-designed human evaluation substantiates the serious hallucination issue, revealing that even GPT-3.5 produces factual outputs less than 25% of the time. This underscores the importance of… ▽ More

    Submitted 20 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted in NAACL 2024

  31. arXiv:2310.14498  [pdf

    physics.ed-ph cs.CY

    Reforming Physics Exams Using Openly Accessible Large Isomorphic Problem Banks created with the assistance of Generative AI: an Explorative Study

    Authors: Zhongzhou Chen, Emily Frederick, Colleen Cui, Munaimah Khan, Christopher Klatt, Mercedith Huang, Shiyang Su

    Abstract: This paper explores using large isomorphic problem banks to overcome many challenges of traditional exams in large STEM classes, especially the threat of content sharing websites and generative AI to the security of exam items. We first introduce an efficient procedure for creating large numbers of isomorphic physics problems, assisted by the large language model GPT-3 and several other open-sourc… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  32. arXiv:2310.08149  [pdf, ps, other

    nucl-th

    Semi-relativistic antisymmetrized molecular dynamics for energetic neutron production in intermediate energy heavy-ion reactions

    Authors: Q. Hu, G. Y. Tian, R. Wada, X. Q. Liu, W. P. Lin, H. Zheng, Y. P. Zhang, Z. Q. Chen, R. Han, M. R. Huang

    Abstract: Relativistic corrections have been made in the non-relativistic antisymmetrized molecular dynamics (AMD) simulations to apply to the high energy neutron production in the $^{12}$C+$^{12}$C and $^{16}$O+$^{12}$C collisions at incident energies of 290 and 400 MeV/nucleon. The corrections are made in kinematics alone and no nucleon-nucleon inelastic scatterings nor meson productions are taken into ac… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  33. arXiv:2310.07234  [pdf, other

    cs.LG

    Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality

    Authors: Liyuan Wang, **gyi Xie, Xingxing Zhang, Mingyi Huang, Hang Su, Jun Zhu

    Abstract: Prompt-based continual learning is an emerging direction in leveraging pre-trained knowledge for downstream continual learning, and has almost reached the performance pinnacle under supervised pre-training. However, our empirical research reveals that the current strategies fall short of their full potential under the more realistic self-supervised pre-training, which is essential for handling vas… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 23 pages, 20 figures, 11 tables, accepted by NeurIPS as a Spotlight

  34. arXiv:2310.06484  [pdf, other

    cs.AI

    Memory efficient location recommendation through proximity-aware representation

    Authors: Xuan Luo, Mingqing Huang, Rui Lv, Hui Zhao

    Abstract: Sequential location recommendation plays a huge role in modern life, which can enhance user experience, bring more profit to businesses and assist in government administration. Although methods for location recommendation have evolved significantly thanks to the development of recommendation systems, there is still limited utilization of geographic information, along with the ongoing challenge of… ▽ More

    Submitted 24 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

  35. arXiv:2310.05393  [pdf, other

    cs.CV

    Hierarchical Side-Tuning for Vision Transformers

    Authors: Weifeng Lin, Ziheng Wu, Wentao Yang, Mingxin Huang, Jun Huang, Lianwen **

    Abstract: Fine-tuning pre-trained Vision Transformers (ViTs) has showcased significant promise in enhancing visual recognition tasks. Yet, the demand for individualized and comprehensive fine-tuning processes for each task entails substantial computational and memory costs, posing a considerable challenge. Recent advancements in Parameter-Efficient Transfer Learning (PETL) have shown potential for achieving… ▽ More

    Submitted 15 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 10 pages, 8 figures

  36. arXiv:2310.05317  [pdf, other

    cs.CL cs.AI

    Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond

    Authors: Siyang Liu, Naihao Deng, Sahand Sabour, Yilin Jia, Minlie Huang, Rada Mihalcea

    Abstract: We propose task-adaptive tokenization as a way to adapt the generation pipeline to the specifics of a downstream task and enhance long-form generation in mental health. Inspired by insights from cognitive science, our task-adaptive tokenizer samples variable segmentations from multiple outcomes, with sampling probabilities optimized based on task-specific data. We introduce a strategy for building… ▽ More

    Submitted 13 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted at the main conference of The 2023 Conference on Empirical Methods in Natural Language Processing; 8 pages

    MSC Class: 68 ACM Class: I.2.7

    Journal ref: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  37. arXiv:2310.03368  [pdf, other

    cs.CL

    Evaluating Hallucinations in Chinese Large Language Models

    Authors: Qinyuan Cheng, Tianxiang Sun, Wenwei Zhang, Siyin Wang, Xiangyang Liu, Mozhi Zhang, Junliang He, Mianqiu Huang, Zhangyue Yin, Kai Chen, Xipeng Qiu

    Abstract: In this paper, we establish a benchmark named HalluQA (Chinese Hallucination Question-Answering) to measure the hallucination phenomenon in Chinese large language models. HalluQA contains 450 meticulously designed adversarial questions, spanning multiple domains, and takes into account Chinese historical culture, customs, and social phenomena. During the construction of HalluQA, we consider two ty… ▽ More

    Submitted 25 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Work in progress

  38. arXiv:2310.01041  [pdf, other

    cs.CL

    Language Model Decoding as Direct Metrics Optimization

    Authors: Haozhe Ji, Pei Ke, Hongning Wang, Minlie Huang

    Abstract: Despite the remarkable advances in language modeling, current mainstream decoding methods still struggle to generate texts that align with human texts across different aspects. In particular, sampling-based methods produce less-repetitive texts which are often disjunctive in discourse, while search-based methods maintain topic coherence at the cost of increased repetition. Overall, these methods f… ▽ More

    Submitted 5 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 33 pages, 3 figures

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR 2024)

  39. arXiv:2309.17452  [pdf, other

    cs.CL cs.AI

    ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

    Authors: Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Minlie Huang, Nan Duan, Weizhu Chen

    Abstract: Large language models have made significant progress in various language tasks, yet they still struggle with complex mathematics. In this paper, we propose ToRA a series of Tool-integrated Reasoning Agents designed to solve challenging mathematical problems by seamlessly integrating natural language reasoning with the utilization of external tools (e.g., computation libraries and symbolic solvers)… ▽ More

    Submitted 21 February, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: ICLR 2024; First two authors equal contribution

  40. arXiv:2309.16103  [pdf, other

    cond-mat.soft cond-mat.stat-mech physics.flu-dyn

    Non-equilibrium molecular dynamics of steady-state fluid transport through a 2D membrane driven by a concentration gradient

    Authors: Daniel J. Rankin, David M. Huang

    Abstract: We use a novel non-equilibrium algorithm to simulate steady-state fluid transport through a two-dimensional (2D) membrane due to a concentration gradient by molecular dynamics (MD) for the first time. We confirm that, as required by the Onsager reciprocal relations in the linear-response regime, the solution flux obtained using this algorithm agrees with the excess solute flux obtained from an est… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Journal ref: J. Chem. Phys. 159, 214705 (2023)

  41. arXiv:2309.14627  [pdf, other

    quant-ph physics.chem-ph

    A First Principles Derivation of Energy Conserving Momentum Jumps in Surface Hop** Simulations

    Authors: Dorothy Miaoyu Huang, Austin T. Green, Craig C. Martens

    Abstract: The fewest switches surface hop** (FSSH) method proposed by Tully in 1990 [J. C Tully, J. Chem. Phys. 93, 1061 (1990)] -- along with its many later variations -- is basis for most practical simulations of molecular dynamics with electronic transitions in realistic systems. Despite its popularity, a rigorous formal derivation of the algorithm has yet to be achieved. In this paper, we derive the e… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures

  42. arXiv:2309.09765  [pdf

    cs.CV

    Localization-Guided Track: A Deep Association Multi-Object Tracking Framework Based on Localization Confidence of Detections

    Authors: Ting Meng, Chunyun Fu, Mingguang Huang, Xiyang Wang, Jiawei He, Tao Huang, Wankai Shi

    Abstract: In currently available literature, no tracking-by-detection (TBD) paradigm-based tracking method has considered the localization confidence of detection boxes. In most TBD-based methods, it is considered that objects of low detection confidence are highly occluded and thus it is a normal practice to directly disregard such objects or to reduce their priority in matching. In addition, appearance si… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures

  43. arXiv:2309.09521  [pdf

    physics.app-ph

    Large Nonreciprocity of Shear-Horizontal Surface Acoustic Waves induced by Magnetoelastic Bilayers

    Authors: Mingxian Huang, Yuanyuan Liu, Wenbin Hu, Yutong Wu, Wen Wang, Wei He, Huaiwu Zhang, Feiming Bai

    Abstract: We report large nonreciprocity in the transmission of shear-horizontal surface acoustic waves (SAWs) on LiTaO3 substrate coated with a FeCoSiB/NiFeCu magnetoelastic bilayer. The large difference in saturation magnetization of the two layers not only brings nonreciprocal spin waves (SWs), but also ensures the phonon-magnon (SAWs-SWs) coupling at relatively low wavenumbers. It is found that the angl… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  44. arXiv:2309.09485  [pdf

    cs.CV

    Distributional Estimation of Data Uncertainty for Surveillance Face Anti-spoofing

    Authors: Mouxiao Huang

    Abstract: Face recognition systems have become increasingly vulnerable to security threats in recent years, prompting the use of Face Anti-spoofing (FAS) to protect against various types of attacks, such as phone unlocking, face payment, and self-service security inspection. While FAS has demonstrated its effectiveness in traditional settings, securing it in long-distance surveillance scenarios presents a s… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  45. arXiv:2309.07045  [pdf, other

    cs.CL

    SafetyBench: Evaluating the Safety of Large Language Models

    Authors: Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang

    Abstract: With the rapid development of Large Language Models (LLMs), increasing attention has been paid to their safety concerns. Consequently, evaluating the safety of LLMs has become an essential task for facilitating the broad applications of LLMs. Nevertheless, the absence of comprehensive safety evaluation benchmarks poses a significant impediment to effectively assess and enhance the safety of LLMs.… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: ACL 2024 Main Conference

  46. Interplay between the muon $g-2$ anomaly and the PTA nHZ gravitational waves from domain walls in next-to minimal supersymmetric standard model

    Authors: Ming Xia Huang, Fei Wang, Ying Kai Zhang

    Abstract: With some explicitly $Z_3$ breaking terms in the NMSSM effective superpotential and scalar potential, domain walls (DWs) from spontaneously breaking of the discrete symmetry in approximate $Z_3$-invariant NMSSM can collapse and lead to observable stochastic gravitational wave (GW) background signals. In the presence of a hidden sector, such terms may originate from the geometric superconformal bre… ▽ More

    Submitted 17 April, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 33 pages, 6 figures, typos corrected, version published in PRD

    Journal ref: Phys. Rev. D 109, 075032 (2014)

  47. arXiv:2309.06156  [pdf, other

    hep-ph

    $D_{(s)}-$ mesons semileptonic form factors in the 4-flavor holographic QCD

    Authors: Hiwa A. Ahmed, Yidian Chen, Mei Huang

    Abstract: We investigate semileptonic form factors of $D_{(s)}$ meson from a modified soft-wall 4-flavor holographic model. The model successfully reproduces the masses and decay constants of various mesons, including $ρ$, $K^*$, $D^*$, $D_s^*$, $a_1$, $K_1$, $f_1$, $D_1$,$D_{s1}$, $π$, $K$, $η$, $D$, and $D_s$. Moreover, we study the semileptonic decay processes $D^{+} \to (π, K, η) l^{+} ν_{l}$ and… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2308.14975

  48. arXiv:2309.04819  [pdf, other

    quant-ph cs.CR cs.LG

    Detecting Violations of Differential Privacy for Quantum Algorithms

    Authors: Ji Guan, Wang Fang, Mingyu Huang, Mingsheng Ying

    Abstract: Quantum algorithms for solving a wide range of practical problems have been proposed in the last ten years, such as data search and analysis, product recommendation, and credit scoring. The concern about privacy and other ethical issues in quantum computing naturally rises up. In this paper, we define a formal framework for detecting violations of differential privacy for quantum algorithms. A det… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Journal ref: In Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security (CCS 2023)

  49. arXiv:2309.04359  [pdf, other

    cond-mat.quant-gas cond-mat.supr-con physics.atom-ph

    Irreversible entropy transport enhanced by fermionic superfluidity

    Authors: Philipp Fabritius, Jeffrey Mohan, Mohsen Talebi, Simon Wili, Wilhelm Zwerger, Meng-Zi Huang, Tilman Esslinger

    Abstract: The nature of particle and entropy flow between two superfluids is often understood in terms of reversible flow carried by an entropy-free, macroscopic wavefunction. While this wavefunction is responsible for many intriguing properties of superfluids and superconductors, its interplay with excitations in non-equilibrium situations is less understood. Here, we observe large concurrent flows of both… ▽ More

    Submitted 22 April, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: This version of the article has been accepted for publication, after peer review but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1038/s41567-024-02483-3

    Journal ref: Nature Physics (2024)

  50. arXiv:2309.03882  [pdf, other

    cs.CL

    Large Language Models Are Not Robust Multiple Choice Selectors

    Authors: Chujie Zheng, Hao Zhou, Fandong Meng, Jie Zhou, Minlie Huang

    Abstract: Multiple choice questions (MCQs) serve as a common yet important task format in the evaluation of large language models (LLMs). This work shows that modern LLMs are vulnerable to option position changes in MCQs due to their inherent "selection bias", namely, they prefer to select specific option IDs as answers (like "Option A"). Through extensive empirical analyses with 20 LLMs on three benchmarks… ▽ More

    Submitted 21 February, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: ICLR 2024 Spotlight