Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

Wang, Xin Eric; Jain, Vihan; Ie, Eugene; Wang, William Yang; Kozareva, Zornitsa; Ravi, Sujith

Computer Science > Artificial Intelligence

arXiv:2003.00443 (cs)

[Submitted on 1 Mar 2020 (v1), last revised 21 Jul 2020 (this version, v5)]

Title:Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

Authors:Xin Eric Wang, Vihan Jain, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi

View PDF

Abstract:Recent research efforts enable study for natural language grounded navigation in photo-realistic environments, e.g., following natural language instructions or dialog. However, existing methods tend to overfit training data in seen environments and fail to generalize well in previously unseen environments. To close the gap between seen and unseen environments, we aim at learning a generalized navigation model from two novel perspectives: (1) we introduce a multitask navigation model that can be seamlessly trained on both Vision-Language Navigation (VLN) and Navigation from Dialog History (NDH) tasks, which benefits from richer natural language guidance and effectively transfers knowledge across tasks; (2) we propose to learn environment-agnostic representations for the navigation policy that are invariant among the environments seen during training, thus generalizing better on unseen environments. Extensive experiments show that environment-agnostic multitask learning significantly reduces the performance gap between seen and unseen environments, and the navigation agent trained so outperforms baselines on unseen environments by 16% (relative measure on success rate) on VLN and 120% (goal progress) on NDH. Our submission to the CVDN leaderboard establishes a new state-of-the-art for the NDH task on the holdout test set. Code is available at this https URL.

Comments:	ECCV 2020
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2003.00443 [cs.AI]
	(or arXiv:2003.00443v5 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2003.00443

Submission history

From: Xin Eric Wang [view email]
[v1] Sun, 1 Mar 2020 09:06:31 UTC (3,343 KB)
[v2] Mon, 9 Mar 2020 22:06:54 UTC (2,210 KB)
[v3] Thu, 12 Mar 2020 18:20:39 UTC (2,210 KB)
[v4] Fri, 17 Jul 2020 23:54:02 UTC (2,400 KB)
[v5] Tue, 21 Jul 2020 02:54:38 UTC (2,400 KB)

Computer Science > Artificial Intelligence

Title:Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators