Skip to main content

Showing 1–3 of 3 results for author: Gui, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.16696  [pdf, other

    cs.CL

    Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

    Authors: Anchun Gui, Jian Li, Yong Dai, Nan Du, Han Xiao

    Abstract: Tool-augmented large language models (LLMs) are attracting widespread attention when accessing up-to-date knowledge and alleviating hallucination issues. Nowadays, advanced closed-source LLMs (e.g., ChatGPT) have demonstrated surprising tool-usage capabilities through prompting and in-context learning techniques. To empower the capabilities of open-source LLMs (e.g., LLaMA) in manipulating tools,… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 20 pages, 18 figures

  2. arXiv:2305.10329  [pdf, other

    cs.LG

    G-Adapter: Towards Structure-Aware Parameter-Efficient Transfer Learning for Graph Transformer Networks

    Authors: Anchun Gui, **qiang Ye, Han Xiao

    Abstract: It has become a popular paradigm to transfer the knowledge of large-scale pre-trained models to various downstream tasks via fine-tuning the entire model parameters. However, with the growth of model scale and the rising number of downstream tasks, this paradigm inevitably meets the challenges in terms of computation consumption and memory footprint issues. Recently, Parameter-Efficient Fine-Tunin… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 19 pages, 10 figures

  3. arXiv:2305.04573  [pdf, other

    cs.CL

    HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation

    Authors: Anchun Gui, Han Xiao

    Abstract: To fully leverage the advantages of large-scale pre-trained language models (PLMs) on downstream tasks, it has become a ubiquitous adaptation paradigm to fine-tune the entire parameters of PLMs. However, this paradigm poses issues of inefficient updating and resource over-consuming for fine-tuning in data-scarce and resource-limited scenarios, because of the large scale of parameters in PLMs. To a… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 15 pages, 11 figures; Accepted in ACL 2023 (long + main)