Skip to main content

Showing 51–100 of 1,138 results for author: Zhang, N

.
  1. arXiv:2403.04918  [pdf, other

    cs.CR

    Secure Information Embedding and Extraction in Forensic 3D Fingerprinting

    Authors: Canran Wang, **wen Wang, Mi Zhou, Vinh Pham, Senyue Hao, Chao Zhou, Ning Zhang, Netanel Raviv

    Abstract: The prevalence of 3D printing poses a significant risk to public safety, as any individual with internet access and a commodity printer is able to produce untraceable firearms, keys, counterfeit products, etc. To aid government authorities in combating these new security threats, several approaches have been taken to tag 3D-prints with identifying information. Known as fingerprints, this informati… ▽ More

    Submitted 12 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  2. arXiv:2403.04459  [pdf, ps, other

    physics.comp-ph math.NA physics.optics

    An efficient method for calculating resonant modes in biperiodic photonic structures

    Authors: Nan Zhang, Ya Yan Lu

    Abstract: Many photonic devices, such as photonic crystal slabs, cross gratings, and periodic metasurfaces, are biperiodic structures with two independent periodic directions, and are sandwiched between two homogeneous media. Many applications of these devices are closely related to resonance phenomena. Therefore, efficient computation of resonant modes is crucial in device design and structure analysis. Si… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  3. arXiv:2403.04267  [pdf, other

    astro-ph.EP astro-ph.IM

    Parallel numerical simulation of impact crater with perfect matched layers

    Authors: Huacheng Li, Zongyu Yue, Nan Zhang, **hai Zhang, Zhongzheng Miao

    Abstract: Impact craters are the primary geomorphic features on the surfaces of celestial bodies such as the Moon, and their formation has significant implications for the evolutionary history of the celestial body. The study of the impact crater formation process relies mainly on numerical simulation methods, with two-dimensional simulations capable of reproducing general patterns of impact processes while… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 17 pages, 8 figures

  4. arXiv:2403.03101  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.MA

    KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents

    Authors: Yuqi Zhu, Shuofei Qiao, Yixin Ou, Shumin Deng, Ningyu Zhang, Shiwei Lyu, Yue Shen, Lei Liang, **jie Gu, Huajun Chen

    Abstract: Large Language Models (LLMs) have demonstrated great potential in complex reasoning tasks, yet they fall short when tackling more sophisticated challenges, especially when interacting with environments through generating executable actions. This inadequacy primarily stems from the lack of built-in action knowledge in language agents, which fails to effectively guide the planning trajectories durin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Work in progress. Project page: https://zjunlp.github.io/project/KnowAgent/ Code: https://github.com/zjunlp/KnowAgent

  5. arXiv:2403.02075  [pdf, other

    cs.CV

    DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction

    Authors: Weiyi Lv, Yuhang Huang, Ning Zhang, Ruei-Sung Lin, Mei Han, Dan Zeng

    Abstract: In Multiple Object Tracking, objects often exhibit non-linear motion of acceleration and deceleration, with irregular direction changes. Tacking-by-detection (TBD) trackers with Kalman Filter motion prediction work well in pedestrian-dominant scenarios but fall short in complex situations when multiple objects perform non-linear and diverse motion simultaneously. To tackle the complex non-linear m… ▽ More

    Submitted 20 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: CVPR2024

  6. arXiv:2402.18649  [pdf, other

    cs.CR cs.AI

    A New Era in LLM Security: Exploring Security Concerns in Real-World LLM-based Systems

    Authors: Fangzhou Wu, Ning Zhang, Somesh Jha, Patrick McDaniel, Chaowei Xiao

    Abstract: Large Language Model (LLM) systems are inherently compositional, with individual LLM serving as the core foundation with additional layers of objects such as plugins, sandbox, and so on. Along with the great potential, there are also increasing concerns over the security of such probabilistic intelligent systems. However, existing studies on LLM security often focus on individual LLM, but without… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  7. arXiv:2402.18215  [pdf

    cond-mat.soft

    Gravity effects on a bio-inspired self-burrowing probe in granular soils

    Authors: Bowen Wang, Ningning Zhang, Yuyan Chen, Alejandro Martinez, Raul Fuentes

    Abstract: In recent years, self-burrowing probes have been studied since they can be suitable for soil monitoring in locations with limited access such as outer space bodies and underneath existing structures. We study the performance of a self-burrowing probe under different gravity conditions, from low gravity (i.e., 1/6g, 1/3g and 1g) to high gravity (i.e., 5g, 10g and 15g), specifically in terms of pene… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  8. Distribution of number of peaks within a long gamma-ray burst

    Authors: C. Guidorzi, M. Sartori, R. Maccary, A. Tsvetkova, L. Amati, L. Bazzanini, M. Bulla, A. E. Camisasca, L. Ferro, F. Frontera, C. K. Li, S. L. Xiong, S. N. Zhang

    Abstract: The variety of long duration gamma-ray burst (LGRB) light curves (LCs) encode a wealth of information on how LGRB engines release energy following the collapse of the progenitor star. Attempts to characterise GRB LCs focused on a number of properties, such as the minimum variability timescale, power density spectra (both ensemble average and individual), or with different definitions of variabilit… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 7 pages, 5 figures, accepted by A&A

    Journal ref: A&A 685, A34 (2024)

  9. arXiv:2402.16123  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    InstructEdit: Instruction-based Knowledge Editing for Large Language Models

    Authors: Ningyu Zhang, Bozhong Tian, Siyuan Cheng, Xiaozhuan Liang, Yi Hu, Kouying Xue, Yanjie Gou, Xi Chen, Huajun Chen

    Abstract: Knowledge editing for large language models can offer an efficient solution to alter a model's behavior without negatively impacting the overall performance. However, the current approaches encounter issues with limited generalizability across tasks, necessitating one distinct editor for each task, significantly hindering the broader applications. To address this, we take the first step to analyze… ▽ More

    Submitted 28 April, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: IJCAI 2024; the project website is at https://www.zjukg.org/project/InstructEdit/

  10. arXiv:2402.14710  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus

    Authors: Honghao Gui, Lin Yuan, Hongbin Ye, Ningyu Zhang, Mengshu Sun, Lei Liang, Huajun Chen

    Abstract: Large Language Models (LLMs) demonstrate remarkable potential across various domains; however, they exhibit a significant performance gap in Information Extraction (IE). Note that high-quality instruction data is the vital key for enhancing the specific capabilities of LLMs, while current IE datasets tend to be small in scale, fragmented, and lack standardized schema. To this end, we introduce IEP… ▽ More

    Submitted 26 May, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ACL 2024 (short); 21 pages; Github: https://github.com/zjunlp/IEPile

  11. arXiv:2402.14226  [pdf, other

    astro-ph.HE hep-ph

    Broadband noise and quasi-periodic oscillation characteristics of the X-ray pulsar RX J0440.9+4431

    Authors: P. P. Li, L. Tao, R. C. Ma, M. Y. Ge, Q. C. Zhao, S. J. Zhao, L. Zhang, Q. C. Bu, L. D. Kong, Y. L. Tuo, L. Ji, S. Zhang, J. L. Qu, S. N. Zhang, Y. Huang, X. Ma, W. T. Ye, Q. C. Shui

    Abstract: We present a comprehensive timing analysis on the Be/X-ray binary pulsar RX J0440.9+4431 using observations from \textit{NICER} and \textit{Insight}-HXMT during the 2022--2023 outburst. The power density spectrum (PDS) of RX J0440.9+4431 exhibits typical aperiodic variability in X-ray flux across a wide frequency range. During a super-critical accretion state, we detect quasi-periodic oscillations… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 8 pages, 7 figures. Accepted in MNRAS

  12. arXiv:2402.12802  [pdf, ps, other

    math.DG

    The Minkowski problem for the non-compact convex set with an asymptotic boundary condition

    Authors: Ning Zhang

    Abstract: In this paper, combining the covolume, we study the Minkowski theory for the non-compact convex set with an asymptotic boundary condition. In particular, the mixed covolume of two non-compact convex sets is introduced and its geometric interpretation is obtained by the Hadamard variational formula. The Brunn-Minkowski and Minkowski inequalities for covolume are established, and the equivalence of… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 20 pages

    MSC Class: 52B45; 52A20; 52A39; 53A15

  13. arXiv:2402.09995  [pdf, other

    cs.SE

    iJTyper: An Iterative Type Inference Framework for Java by Integrating Constraint- and Statistically-based Methods

    Authors: Zhixiang Chen, Anji Li, Neng Zhang, Jianguo Chen, Yuan Huang, Zibin Zheng

    Abstract: Inferring the types of API elements in incomplete code snippets (e.g., those on Q&A forums) is a prepositive step required to work with the code snippets. Existing type inference methods can be mainly categorized as constraint-based or statistically-based. The former imposes higher requirements on code syntax and often suffers from low recall due to the syntactic limitation of code snippets. The l… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  14. arXiv:2402.08303   

    cs.CL cs.AI cs.CE cs.HC cs.LG

    ChatCell: Facilitating Single-Cell Analysis with Natural Language

    Authors: Yin Fang, Kangwei Liu, Ningyu Zhang, Xinle Deng, Penghui Yang, Zhuo Chen, Xiangru Tang, Mark Gerstein, Xiaohui Fan, Huajun Chen

    Abstract: As Large Language Models (LLMs) rapidly evolve, their influence in science is becoming increasingly prominent. The emerging capabilities of LLMs in task generalization and free-form dialogue can significantly advance fields like chemistry and biology. However, the field of single-cell biology, which forms the foundational building blocks of living organisms, still faces several challenges. High kn… ▽ More

    Submitted 19 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: I have decided to temporarily withdraw this draft as I am in the process of making further revisions to improve its content. Code: https://github.com/zjunlp/ChatCell Dataset: https://huggingface.co/datasets/zjunlp/ChatCell-Instructions Demo: https://chat.openai.com/g/g-vUwj222gQ-chatcell

  15. arXiv:2402.05391  [pdf, other

    cs.AI cs.CV cs.IR cs.LG

    Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

    Authors: Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

    Abstract: Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the semantic web community's exploration into multi-modal dimensions unlocking new avenues for innovation. In this survey, we carefully review over 300 articles, focusing on KG-aware research in two principal aspects: KG-driven Multi-Modal (KG4MM) learning, where KGs support multi-modal tasks, and Multi-Modal Kno… ▽ More

    Submitted 26 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Ongoing work; 41 pages (Main Text), 55 pages (Total), 11 Tables, 13 Figures, 619 citations; Paper list is available at https://github.com/zjukg/KG-MM-Survey

  16. arXiv:2402.04356  [pdf, other

    cs.SD cs.CV eess.AS

    Bidirectional Autoregressive Diffusion Model for Dance Generation

    Authors: Canyu Zhang, Youbao Tang, Ning Zhang, Ruei-Sung Lin, Mei Han, **g Xiao, Song Wang

    Abstract: Dance serves as a powerful medium for expressing human emotions, but the lifelike generation of dance is still a considerable challenge. Recently, diffusion models have showcased remarkable generative abilities across various domains. They hold promise for human motion generation due to their adaptable many-to-many nature. Nonetheless, current diffusion-based motion generation models often create… ▽ More

    Submitted 22 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  17. arXiv:2402.03190  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.MM

    Unified Hallucination Detection for Multimodal Large Language Models

    Authors: Xiang Chen, Chenxi Wang, Yida Xue, Ningyu Zhang, Xiaoyan Yang, Qiang Li, Yue Shen, Lei Liang, **jie Gu, Huajun Chen

    Abstract: Despite significant strides in multimodal tasks, Multimodal Large Language Models (MLLMs) are plagued by the critical issue of hallucination. The reliable detection of such hallucinations in MLLMs has, therefore, become a vital aspect of model evaluation and the safeguarding of practical application deployment. Prior research in this domain has been constrained by a narrow focus on singular tasks,… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 (main conference)

  18. arXiv:2402.03049  [pdf, other

    cs.CL cs.AI cs.HC cs.IR cs.LG

    EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models

    Authors: Yixin Ou, Ningyu Zhang, Honghao Gui, Ziwen Xu, Shuofei Qiao, Yida Xue, Runnan Fang, Kangwei Liu, Lei Li, Zhen Bi, Guozhou Zheng, Huajun Chen

    Abstract: In recent years, instruction tuning has gained increasing attention and emerged as a crucial technique to enhance the capabilities of Large Language Models (LLMs). To construct high-quality instruction datasets, many instruction processing approaches have been proposed, aiming to achieve a delicate balance between data quantity and data quality. Nevertheless, due to inconsistencies that persist am… ▽ More

    Submitted 23 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ACL 2024 System Demonstrations; Project website: https://zjunlp.github.io/project/EasyInstruct Code: https://github.com/zjunlp/EasyInstruct Video: https://youtu.be/rfQOWYfziFo Demo: https://huggingface.co/spaces/zjunlp/EasyInstruct

  19. arXiv:2402.03005  [pdf, other

    cond-mat.supr-con

    Topological metal and high-order Dirac point in cubic Rashba model

    Authors: Haijiao Ji, Ning Zhang, Noah F. Q. Yuan

    Abstract: We investigate the properties of the two-dimensional model with Rashba-type spin-orbit coupling cubic in electron momentum. In the normal phase, edge states emerge on open boundaries. In the superconducting phase, edge states could evolve into gapped fermionic edge states. Applications to realistic materials of interface superconductors are also discussed.

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures, 1 table

  20. arXiv:2402.02085  [pdf, other

    cs.CV cs.AI

    DeCoF: Generated Video Detection via Frame Consistency: The First Benchmark Dataset

    Authors: Long Ma, Jiajia Zhang, Hong** Deng, Ningyu Zhang, Qinglang Guo, Haiyang Yu, Yong Liao, Pengyuan Zhou

    Abstract: The escalating quality of video generated by advanced video generation methods results in new security challenges, while there have been few relevant research efforts: 1) There is no open-source dataset for generated video detection, 2) No generated video detection method has been proposed so far. To this end, we propose an open-source dataset and a detection method for generated video for the fir… ▽ More

    Submitted 25 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  21. arXiv:2402.01920  [pdf, other

    cs.LG cs.AI cs.CL

    Preference Poisoning Attacks on Reward Model Learning

    Authors: Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang, Ning Zhang, Yevgeniy Vorobeychik

    Abstract: Learning utility, or reward, models from pairwise comparisons is a fundamental component in a number of application domains. These approaches inherently entail collecting preference information from people, with feedback often provided anonymously. Since preferences are subjective, there is no gold standard to compare against; yet, reliance of high-impact systems on preference learning creates a s… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  22. arXiv:2401.17623  [pdf, other

    cs.CL

    Neighboring Perturbations of Knowledge Editing on Large Language Models

    Authors: Jun-Yu Ma, Zhen-Hua Ling, Ningyu Zhang, Jia-Chen Gu

    Abstract: Despite their exceptional capabilities, large language models (LLMs) are prone to generating unintended text due to false or outdated knowledge. Given the resource-intensive nature of retraining LLMs, there has been a notable increase in the development of knowledge editing. However, current approaches and evaluations rarely explore the perturbation of editing on neighboring knowledge. This paper… ▽ More

    Submitted 26 May, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted by ICML 2024

  23. arXiv:2401.17268  [pdf, other

    cs.CL cs.AI cs.LG

    Weaver: Foundation Models for Creative Writing

    Authors: Tiannan Wang, Jiamin Chen, Qingrui Jia, Shuai Wang, Ruoyu Fang, Huilin Wang, Zhaowei Gao, Chunzhao Xie, Chuou Xu, Jihong Dai, Yibin Liu, Jialong Wu, Shengwei Ding, Long Li, Zhiwei Huang, Xinle Deng, Teng Yu, Gangan Ma, Han Xiao, Zixin Chen, Danjun Xiang, Yunxia Wang, Yuanyuan Zhu, Yi Xiao, **g Wang , et al. (21 additional authors not shown)

    Abstract: This work introduces Weaver, our first family of large language models (LLMs) dedicated to content creation. Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models. We then fine-tune Weaver for creative and professional writing purposes and align it to the preference of professional writers using a suit of novel methods for… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  24. arXiv:2401.16292  [pdf, other

    cs.DC

    Pilotfish: Distributed Transaction Execution for Lazy Blockchains

    Authors: Quentin Kniep, Lefteris Kokoris-Kogias, Alberto Sonnino, Igor Zablotchi, Nuda Zhang

    Abstract: Pilotfish is the first scale-out blockchain execution engine able to harness any degree of parallelizability existing in its workload. Pilotfish allows each validator to employ multiple machines, named ExecutionWorkers, under its control to scale its execution layer. Given a sufficiently parallelizable and compute-intensive load, the number of transactions that the validator can execute increases… ▽ More

    Submitted 16 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  25. arXiv:2401.15992  [pdf, other

    astro-ph.HE

    Pulsed Iron line Emission from the First Galactic Ultraluminous X-ray Pulsar Swift J0243.6+6124

    Authors: Y. X. Xiao, Y. J. Xu, M. Y. Ge, F. J. Lu, S. N. Zhang, S. Zhang, L. Tao, J. L. Qu, P. J. Wang, L. D. Kong, Y. L. Tuo, Y. You, S. J. Zhao, J. Q. Peng, Y. F. Du, Y. H. Zhang, W. T. Ye

    Abstract: We report the phase-resolved spectral results of the first Galactic Pulsating Ultra-Luminous X-ray source (PULX) Swift J0243.6+6124, modeling at its 2017-2018 outburst peak using data collected by the Hard X-ray Modulation Telescope (Insight-HXMT). The broad energy coverage of Insight-HXMT allows us to obtain more accurate spectral continuum to reduce the coupling of broad iron line profiles with… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  26. arXiv:2401.15289  [pdf, other

    cs.CR cs.AR

    SoK: Where's the "up"?! A Comprehensive (bottom-up) Study on the Security of Arm Cortex-M Systems

    Authors: Xi Tan, Zheyuan Ma, Sandro Pinto, Le Guan, Ning Zhang, Jun Xu, Zhiqiang Lin, Hongxin Hu, Ziming Zhao

    Abstract: Arm Cortex-M processors are the most widely used 32-bit microcontrollers among embedded and Internet-of-Things devices. Despite the widespread usage, there has been little effort in summarizing their hardware security features, characterizing the limitations and vulnerabilities of their hardware and software stack, and systematizing the research on securing these systems. The goals and contributio… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: To Appear in the 18th USENIX WOOT Conference on Offensive Technologies, August 12-13, 2024

    ACM Class: C.0; K.6.5

  27. arXiv:2401.14619  [pdf, other

    cs.LG

    Resilient Practical Test-Time Adaptation: Soft Batch Normalization Alignment and Entropy-driven Memory Bank

    Authors: Xingzhi Zhou, Zhiliang Tian, Ka Chun Cheung, Simon See, Nevin L. Zhang

    Abstract: Test-time domain adaptation effectively adjusts the source domain model to accommodate unseen domain shifts in a target domain during inference. However, the model performance can be significantly impaired by continuous distribution changes in the target domain and non-independent and identically distributed (non-i.i.d.) test samples often encountered in practical scenarios. While existing memory… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  28. arXiv:2401.13910  [pdf

    physics.optics

    Spatiotemporal optical vortices with controllable radial and azimuthal quantum numbers

    Authors: Xin Liu, Qian Cao, Nianjia Zhang, Andy Chong, Yangjian Cai, Qiwen Zhan

    Abstract: Optical spatiotemporal vortices with transverse photon orbital angular momentum (OAM) have recently become a focal point of research. In this work we theoretically and experimentally investigate optical spatiotemporal vortices with radial and azimuthal quantum numbers, known as spatiotemporal Laguerre-Gaussian (STLG) wavepackets. These 3D wavepackets exhibit phase singularities and cylinder-shaped… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  29. arXiv:2401.12642  [pdf

    physics.optics

    Spatiotemporal Hologram

    Authors: Qian Cao, Nianjia Zhang, Andy Chong, Qiwen Zhan

    Abstract: Spatiotemporal structured light has opened up new avenues for optics and photonics. Current spatiotemporal manipulation of light mostly relies on phase-only devices such as liquid crystal spatial light modulator to generate spatiotemporal optical fields with unique photonic properties. However, simultaneous manipulation of both amplitude and phase of the complex field for the spatiotemporal light… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  30. arXiv:2401.11770  [pdf, ps, other

    cond-mat.str-el

    Evidence for Unfolded Fermi Surfaces in the Charge-Density-Wave State of Kagome Metal FeGe Revealed by de Haas-van Alphen Effect

    Authors: Kaixin Tang, Han**g Zhou, Houpu Li, Senyang Pan, Xueliang Wu, Hongyu Li, Nan Zhang, Chuanying Xi, **glei Zhang, Aifeng Wang, Xiangang Wan, Ziji Xiang, Xianhui Chen

    Abstract: The antiferromagnetic kagome lattice compound FeGe has been revealed to host an emergent charge-density-wave (CDW) state which manifests complex interplay between the spin, charge and lattice degrees of freedom. Here, we present a comprehensive study of the de Haas-van Alphen effect by measuring torque magnetometry under magnetic fields up to 45.2 T to map Fermi surfaces in this unusual CDW state.… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 7 pages, 4 figures, to be published in Phys. Rev. Research

  31. arXiv:2401.05268  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.MA

    AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning

    Authors: Shuofei Qiao, Ningyu Zhang, Runnan Fang, Yujie Luo, Wangchunshu Zhou, Yuchen Eleanor Jiang, Chengfei Lv, Huajun Chen

    Abstract: Language agents have achieved considerable performance on various complex question-answering tasks by planning with external tools. Despite the incessant exploration in this field, existing language agent systems still struggle with costly, non-reproducible data reliance and face the challenge of compelling a single model for multiple functions. To this end, we introduce AutoAct, an automatic agen… ▽ More

    Submitted 26 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: ACL 2024

  32. arXiv:2401.01286  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    A Comprehensive Study of Knowledge Editing for Large Language Models

    Authors: Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, **tian Zhang, Yuansheng Ni, Siyuan Cheng, Ziwen Xu, Xin Xu, Jia-Chen Gu, Yong Jiang, Pengjun Xie, Fei Huang, Lei Liang, Zhiqiang Zhang, Xiaowei Zhu, Jun Zhou, Huajun Chen

    Abstract: Large Language Models (LLMs) have shown extraordinary capabilities in understanding and generating text that closely mirrors human communication. However, a primary limitation lies in the significant computational demands during training, arising from their extensive parameterization. This challenge is further intensified by the dynamic nature of the world, necessitating frequent updates to LLMs t… ▽ More

    Submitted 28 March, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Ongoing work; 52 pages, 282 citations; benchmark is available at https://huggingface.co/datasets/zjunlp/KnowEdit code is available at https://github.com/zjunlp/EasyEdit paper list is available at https://github.com/zjunlp/KnowledgeEditingPapers

  33. arXiv:2401.00625  [pdf, ps, other

    cs.LG

    Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models

    Authors: Guangji Bai, Zheng Chai, Chen Ling, Shiyu Wang, Jiaying Lu, Nan Zhang, Tingwei Shi, Ziyang Yu, Mengdan Zhu, Yifei Zhang, Carl Yang, Yue Cheng, Liang Zhao

    Abstract: The burgeoning field of Large Language Models (LLMs), exemplified by sophisticated models like OpenAI's ChatGPT, represents a significant advancement in artificial intelligence. These models, however, bring forth substantial challenges in the high consumption of computational, memory, energy, and financial resources, especially in environments with limited resource capabilities. This survey aims t… ▽ More

    Submitted 3 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: Preprint. GitHub repo: https://github.com/tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

  34. arXiv:2312.15159  [pdf, other

    cs.LG cs.AI cs.AR cs.CL

    Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference

    Authors: Hongzheng Chen, Jiahao Zhang, Yixiao Du, Shaojie Xiang, Zichao Yue, Niansong Zhang, Yaohui Cai, Zhiru Zhang

    Abstract: Recent advancements in large language models (LLMs) boasting billions of parameters have generated a significant demand for efficient deployment in inference workloads. The majority of existing approaches rely on temporal architectures that reuse hardware units for different network layers and operators. However, these methods often encounter challenges in achieving low latency due to considerable… ▽ More

    Submitted 7 April, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in the FCCM'24 Journal Track and will appear in ACM Transactions on Reconfigurable Technology and Systems (TRETS)

  35. The bright black hole X-ray binary 4U 1543--47 during 2021 outburst: a thick accretion disk inflated by high luminosity

    Authors: S. J. Zhao, L. Tao, P. P. Li, R. Soria, H. Feng, Y. X. Zhang, R. C. Ma, W. D. Zhang, E. L. Qiao, Q. Q. Yin, S. N. Zhang, L. Zhang, Q. C. Bu, X. Ma, Y. Huang, M. Y. Ge, X. B. Li, Q. C. Zhao, J. Q. Peng, Y. X. Xiao

    Abstract: The black hole X-ray binary source 4U 1543--47 experienced a super-Eddington outburst in 2021, reaching a peak flux of up to $\sim1.96\times10^{-7}\rm erg\ \rm cm^{-2}\ \rm s^{-1}$ ($\sim 8.2$ Crab) in the 2--10\,keV band. Soon after the outburst began, it rapidly transitioned into the soft state. Our goal is to understand how the accretion disk structure deviates from a standard thin disk when th… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in Astronomy and Astrophysics. 15 pages, 4 tables, 12 figures

    Journal ref: A&A 685, A42 (2024)

  36. arXiv:2312.08682  [pdf, other

    physics.optics physics.app-ph

    High-coherence parallelization in integrated photonics

    Authors: Xuguang Zhang, Zixuan Zhou, Yijun Guo, Minxue Zhuang, Warren **, Bitao Shen, Yujun Chen, Jiahui Huang, Zihan Tao, Ming **, Ruixuan Chen, Zhangfeng Ge, Zhou Fang, Ning Zhang, Yadong Liu, Pengfei Cai, Weiwei Hu, Haowen Shu, Dong Pan, John E. Bowers, Xingjun Wang, Lin Chang

    Abstract: Coherent optics has profoundly impacted diverse applications ranging from communications, LiDAR to quantum computations. However, building coherent systems in integrated photonics previously came at great expense in hardware integration and energy efficiency: the lack of a power-efficient way to generate highly coherent light necessitates bulky lasers and amplifiers, while frequency and phase reco… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  37. arXiv:2312.06259  [pdf, other

    cs.CV

    Adaptive Annotation Distribution for Weakly Supervised Point Cloud Semantic Segmentation

    Authors: Zhiyi Pan, Nan Zhang, Wei Gao, Shan Liu, Ge Li

    Abstract: Weakly supervised point cloud semantic segmentation has attracted a lot of attention due to its ability to alleviate the heavy reliance on fine-grained annotations of point clouds. However, in practice, sparse annotation usually exhibits a distinct non-uniform distribution in point cloud, which poses challenges for weak supervision. To address these issues, we propose an adaptive annotation distri… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  38. arXiv:2312.05275  [pdf, other

    cs.CR cs.AI

    Exploring the Limits of ChatGPT in Software Security Applications

    Authors: Fangzhou Wu, Qingzhao Zhang, Ati Priya Bajaj, Tiffany Bao, Ning Zhang, Ruoyu "Fish" Wang, Chaowei Xiao

    Abstract: Large language models (LLMs) have undergone rapid evolution and achieved remarkable results in recent times. OpenAI's ChatGPT, backed by GPT-3.5 or GPT-4, has gained instant popularity due to its strong capability across a wide range of tasks, including natural language tasks, coding, mathematics, and engaging conversations. However, the impacts and limits of such LLMs in system security domain ar… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  39. arXiv:2311.18717  [pdf, other

    econ.GN cs.CR cs.MA q-fin.TR stat.AP

    NFT Wash Trading: Direct vs. Indirect Estimation

    Authors: Brett Hemenway Falk, Gerry Tsoukalas, Niuniu Zhang

    Abstract: Recent studies estimate around 70% of traded value on off-chain crypto exchanges like Binance is wash trading. This paper turns to NFT markets, where the on-chain nature of transactions-a key tenet of Web3 innovation-enables more direct estimation methods to be applied. Focusing on three of the largest NFT marketplaces, we find 30-40% of NFT volume and 25-95% of traded value involve wash trading.… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  40. arXiv:2311.15717  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Evidence of spin density waves in La$_3$Ni$_2$O$_{7-δ}$

    Authors: Kaiwen Chen, Xiangqi Liu, Jiachen Jiao, Muyuan Zou, Yixuan Luo, Qiong Wu, Ningyuan Zhang, Yanfeng Guo, Lei Shu

    Abstract: The recently discovered superconductivity with critical temperature $T_c$ up to 80 K in the double-layer Nickelate La$_3$Ni$_2$O$_{7-δ}$ under pressure has drawn great attention. Here we report the positive muon spin relaxation ($μ^+$SR) study of polycrystalline La$_3$Ni$_2$O$_{6.92}$ under ambient pressure. Zero-field $μ^+$SR experiments reveal the existence of magnetic order in La$_3$Ni$_2$O… ▽ More

    Submitted 13 May, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  41. Quantum Simulation of Bound-State-Enhanced Quantum Metrology

    Authors: Cheng-Ge Liu, Cong-Wei Lu, Na-Na Zhang, Qing Ai

    Abstract: Quantum metrology explores quantum effects to improve the measurement accuracy of some physical quantities beyond the classical limit. However, due to the interaction between the system and the environment, the decoherence can significantly reduce the accuracy of the measurement. Many methods have been proposed to restore the accuracy of the measurement in the long-time limit. Recently, it has bee… ▽ More

    Submitted 2 May, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: 9 pages,9 figures

    Journal ref: Phys. Rev. A 109, 042623 (2024)

  42. arXiv:2311.13948  [pdf, ps, other

    physics.optics math-ph

    Non-generic bound states in the continuum in waveguides with lateral leakage channels

    Authors: Nan Zhang, Ya Yan Lu

    Abstract: For optical waveguides with a layered background which itself is a slab waveguide, a guided mode is a bound state in the continuum (BIC), if it coexists with slab modes propagating outwards in the lateral direction; i.e., there are lateral leakage channels. It is known that generic BICs in optical waveguides with lateral leakage channels are robust in the sense that they still exist if the wavegui… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  43. arXiv:2311.13162  [pdf, other

    cs.SI cs.DB

    Top-L Most Influential Community Detection Over Social Networks (Technical Report)

    Authors: Nan Zhang, Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: In many real-world applications such as social network analysis and online marketing/advertising, the community detection is a fundamental task to identify communities (subgraphs) in social networks with high structural cohesiveness. While previous works focus on detecting communities alone, they do not consider the collective influences of users in these communities on other user nodes in social… ▽ More

    Submitted 1 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  44. arXiv:2311.09101  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Towards A Unified View of Answer Calibration for Multi-Step Reasoning

    Authors: Shumin Deng, Ningyu Zhang, Nay Oo, Bryan Hooi

    Abstract: Large Language Models (LLMs) employing Chain-of-Thought (CoT) prompting have broadened the scope for improving multi-step reasoning capabilities. We generally divide multi-step reasoning into two phases: path generation to generate the reasoning path(s); and answer calibration post-processing the reasoning path(s) to obtain a final answer. However, the existing literature lacks systematic analysis… ▽ More

    Submitted 25 February, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Working in Progress

  45. arXiv:2311.08063  [pdf, ps, other

    quant-ph

    Enhanced mechanical squeezing in an optomechanical system via backward stimulated Brillouin scattering

    Authors: Shan-Shan Chen, Na-Na Zhang, Yong-Rui Guo, Huan Yang, Yong Ma

    Abstract: We investigate theoretically the enhancement of mechanical squeezing in a multimode optomechanical system by introducing a coherent phonon-photon interaction via the backward stimulated Brillouin scattering (BSBS) process. The coherent photon-phonon interaction where two optical modes couple to a Brillouin acoustic mode with a large decay rate provides an extra channel for the cooling of a Duffing… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  46. arXiv:2311.07884  [pdf, other

    cs.CL

    Fair Abstractive Summarization of Diverse Perspectives

    Authors: Yusen Zhang, Nan Zhang, Yixin Liu, Alexander Fabbri, Junru Liu, Ryo Kamoi, Xiaoxin Lu, Caiming Xiong, Jieyu Zhao, Dragomir Radev, Kathleen McKeown, Rui Zhang

    Abstract: People from different social and demographic groups express diverse perspectives and conflicting opinions on a broad set of topics such as product reviews, healthcare, law, and politics. A fair summary should provide a comprehensive coverage of diverse perspectives without underrepresenting certain groups. However, current work in summarization metrics and Large Language Models (LLMs) evaluation h… ▽ More

    Submitted 29 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  47. Atomistic origins of asymmetric charge-discharge kinetics in off-stoichiometric LiNiO$_2$

    Authors: Penghao Xiao, Ning Zhang, Harold Smith Perez, Minjoon Park

    Abstract: LiNiO$_2$ shows poor Li transport kinetics at the ends of charge and discharge in the first cycle, which significantly reduces its available capacity in practice. The atomistic origins of these kinetic limits have not been fully understood. Here, we examine Li transport in LiNiO$_2$ by first-principles-based kinetic Monte Carlo simulations where both long time scale and large length scale are achi… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  48. arXiv:2311.05441  [pdf, other

    hep-th math.AG

    5d SCFTs from Isolated Complete Intersection Singularities

    Authors: Jisheng Mu, Yi-Nan Wang, Hao N. Zhang

    Abstract: In this paper, we explore the zoo of 5d superconformal field theories (SCFTs) constructed from M-theory on Isolated Complete Intersection Singularities (ICIS). We systematically investigate the crepant resolution of such singularities, and obtain a classification of rank $\leqslant 10$ models with a smooth crepant resolution and smooth exceptional divisors, as well as a number of infinite sequence… ▽ More

    Submitted 28 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: v2, 87 pages

  49. arXiv:2311.05132  [pdf, ps, other

    hep-th

    The non-perturbative stringy interaction between NS-brane \& Dp brane

    Authors: J. X. Lu, Nan Zhang

    Abstract: To our best knowledge, the leading non-perturbative stringy interaction between an NS brane and a Dp brane remains unknown. We here present the non-perturbative stringy amplitudes for a system of an F-string and a Dp brane and a system of an NS 5 brane and a Dp brane for $0 \le p \le 6$. In either case, the F or NS5 and the Dp are placed parallel at a separation. We obtain the respective amplitude… ▽ More

    Submitted 26 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 20 pages, 1 table, improved version, two references added

    Report number: USTC-ICTS/PCFT-23-34

  50. arXiv:2311.04487  [pdf, ps, other

    math.CO

    Principal specializations of Schubert polynomials, multi-layered permutations and asymptotics

    Authors: Ningxin Zhang

    Abstract: Let $v(n)$ be the largest principal specialization of Schubert polynomials for layered permutations $v(n) := \max_{w \in \mathcal{L}_n} \mathfrak{S}_w(1,\ldots,1)$. Morales, Pak and Panova proved that there is a limit \[\lim_{n \to \infty} \frac{\log v(n)}{n^2},\] and gave a precise description of layered permutations reaching the maximum. In this paper, we extend Morales Pak and Panova's results… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 16 pages