Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Gao, Shen; Shi, Zhengliang; Zhu, Minghang; Fang, Bowen; Xin, Xin; Ren, Pengjie; Chen, Zhumin; Ma, Jun; Ren, Zhaochun

Computer Science > Artificial Intelligence

arXiv:2308.14034 (cs)

[Submitted on 27 Aug 2023 (v1), last revised 21 Dec 2023 (this version, v2)]

Title:Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Authors:Shen Gao, Zhengliang Shi, Minghang Zhu, Bowen Fang, Xin Xin, Pengjie Ren, Zhumin Chen, Jun Ma, Zhaochun Ren

View PDF HTML (experimental)

Abstract:Augmenting large language models (LLMs) with external tools has emerged as a promising approach to extending the capability of LLMs. Although some works employ open-source LLMs for the tool learning task, most of them are trained in a controlled environment in which LLMs only learn to execute the human-provided tools. However, selecting proper tools from the large toolset is also a crucial ability for the tool learning model to be applied in real-world applications. Existing methods usually directly employ self-instruction methods to train the model, which ignores differences in tool complexity. In this paper, we propose the Confucius, a novel tool learning framework to train LLM to use complicated tools in real-world scenarios, which contains two main phases: (1) We first propose a multi-stage learning method to teach the LLM to use various tools from an easy-to-difficult curriculum; (2) thenceforth, we propose the Iterative Self-instruct from Introspective Feedback (ISIF) to dynamically construct the dataset to improve the ability to use the complicated tool. Extensive experiments conducted on both controlled and real-world settings demonstrate the superiority of our tool learning framework in the real-world application scenarios compared to both tuning-free (e.g. ChatGPT, Claude) and tuning-based baselines (e.g. GPT4Tools).

Comments:	Accepted by AAAI 2024
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2308.14034 [cs.AI]
	(or arXiv:2308.14034v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2308.14034

Submission history

From: Shen Gao [view email]
[v1] Sun, 27 Aug 2023 07:53:00 UTC (765 KB)
[v2] Thu, 21 Dec 2023 07:30:31 UTC (747 KB)

Computer Science > Artificial Intelligence

Title:Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators