Learning Task Decomposition with Ordered Memory Policy Network

Lu, Yuchen; Shen, Yikang; Zhou, Siyuan; Courville, Aaron; Tenenbaum, Joshua B.; Gan, Chuang

Computer Science > Machine Learning

arXiv:2103.10972 (cs)

[Submitted on 19 Mar 2021]

Title:Learning Task Decomposition with Ordered Memory Policy Network

Authors:Yuchen Lu, Yikang Shen, Siyuan Zhou, Aaron Courville, Joshua B. Tenenbaum, Chuang Gan

View PDF

Abstract:Many complex real-world tasks are composed of several levels of sub-tasks. Humans leverage these hierarchical structures to accelerate the learning process and achieve better generalization. In this work, we study the inductive bias and propose Ordered Memory Policy Network (OMPN) to discover subtask hierarchy by learning from demonstration. The discovered subtask hierarchy could be used to perform task decomposition, recovering the subtask boundaries in an unstruc-tured demonstration. Experiments on Craft and Dial demonstrate that our modelcan achieve higher task decomposition performance under both unsupervised and weakly supervised settings, comparing with strong baselines. OMPN can also bedirectly applied to partially observable environments and still achieve higher task decomposition performance. Our visualization further confirms that the subtask hierarchy can emerge in our model.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2103.10972 [cs.LG]
	(or arXiv:2103.10972v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.10972

Submission history

From: Yuchen Lu [view email]
[v1] Fri, 19 Mar 2021 18:13:35 UTC (13,985 KB)

Computer Science > Machine Learning

Title:Learning Task Decomposition with Ordered Memory Policy Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Task Decomposition with Ordered Memory Policy Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators