Learning Rewards from Linguistic Feedback

Sumers, Theodore R.; Ho, Mark K.; Hawkins, Robert D.; Narasimhan, Karthik; Griffiths, Thomas L.

Computer Science > Artificial Intelligence

arXiv:2009.14715 (cs)

[Submitted on 30 Sep 2020 (v1), last revised 3 Jul 2021 (this version, v3)]

Title:Learning Rewards from Linguistic Feedback

Authors:Theodore R. Sumers, Mark K. Ho, Robert D. Hawkins, Karthik Narasimhan, Thomas L. Griffiths

View PDF

Abstract:We explore unconstrained natural language feedback as a learning signal for artificial agents. Humans use rich and varied language to teach, yet most prior work on interactive learning from language assumes a particular form of input (e.g., commands). We propose a general framework which does not make this assumption, using aspect-based sentiment analysis to decompose feedback into sentiment about the features of a Markov decision process. We then perform an analogue of inverse reinforcement learning, regressing the sentiment on the features to infer the teacher's latent reward function. To evaluate our approach, we first collect a corpus of teaching behavior in a cooperative task where both teacher and learner are human. We implement three artificial learners: sentiment-based "literal" and "pragmatic" models, and an inference network trained end-to-end to predict latent rewards. We then repeat our initial experiment and pair them with human teachers. All three successfully learn from interactive human feedback. The sentiment models outperform the inference network, with the "pragmatic" model approaching human performance. Our work thus provides insight into the information structure of naturalistic linguistic feedback as well as methods to leverage it for reinforcement learning.

Comments:	9 pages, 4 figures. AAAI '21
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2009.14715 [cs.AI]
	(or arXiv:2009.14715v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2009.14715

Submission history

From: Theodore Sumers [view email]
[v1] Wed, 30 Sep 2020 14:51:00 UTC (2,612 KB)
[v2] Wed, 16 Dec 2020 15:54:34 UTC (889 KB)
[v3] Sat, 3 Jul 2021 19:03:12 UTC (1,240 KB)

Computer Science > Artificial Intelligence

Title:Learning Rewards from Linguistic Feedback

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Learning Rewards from Linguistic Feedback

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators