What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

Finlayson, Matthew; Richardson, Kyle; Sabharwal, Ashish; Clark, Peter

Computer Science > Computation and Language

arXiv:2204.09148 (cs)

[Submitted on 19 Apr 2022 (v1), last revised 24 May 2022 (this version, v2)]

Title:What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

Authors:Matthew Finlayson, Kyle Richardson, Ashish Sabharwal, Peter Clark

View PDF

Abstract:The instruction learning paradigm -- where a model learns to perform new tasks from task descriptions alone -- has become popular in general-purpose model research. The capabilities of large transformer models as instruction learners, however, remain poorly understood. We use a controlled synthetic environment to characterize such capabilities. Specifically, we use the task of deciding whether a given string matches a regular expression (viewed as an instruction) to identify properties of tasks, instructions, and instances that make instruction learning challenging. For instance, we find that our model, a fine-tuned T5-based text2text transformer, struggles with large regular languages, suggesting that less precise instructions are challenging for models. Additionally, instruction executions that require tracking longer contexts of prior steps are also more difficult. We use our findings to systematically construct a challenging instruction learning dataset, which we call Hard RegSet. Fine-tuning on Hard RegSet, our large transformer learns to correctly interpret only 65.6% of test instructions (with at least 90% accuracy), and 11%-24% of the instructions in out-of-distribution generalization settings. We propose Hard RegSet as a challenging instruction learning task, and a controlled environment for studying instruction learning.

Comments:	Typos corrected, rewordings
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
MSC classes:	68T50
ACM classes:	I.2.7
Cite as:	arXiv:2204.09148 [cs.CL]
	(or arXiv:2204.09148v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2204.09148

Submission history

From: Matthew Finlayson [view email]
[v1] Tue, 19 Apr 2022 22:11:47 UTC (442 KB)
[v2] Tue, 24 May 2022 23:08:27 UTC (443 KB)

Computer Science > Computation and Language

Title:What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators