Closing the Planning-Learning Loop with Application to Autonomous Driving in a Crowd

Cai, Panpan; Hsu, David

Computer Science > Robotics

arXiv:2101.03834v1 (cs)

[Submitted on 11 Jan 2021 (this version), latest version 9 Aug 2022 (v3)]

Title:Closing the Planning-Learning Loop with Application to Autonomous Driving in a Crowd

Authors:Panpan Cai, David Hsu

View PDF

Abstract:Imagine an autonomous robot vehicle driving in dense, possibly unregulated urban traffic. To contend with an uncertain, interactive environment with many traffic participants, the robot vehicle has to perform long-term planning in order to drive effectively and approach human-level performance. Planning explicitly over a long time horizon, however, incurs prohibitive computational cost and is impractical under real-time constraints. To achieve real-time performance for large-scale planning, this paper introduces Learning from Tree Search for Driving (LeTS-Drive), which integrates planning and learning in a close loop. LeTS-Drive learns a driving policy from a planner based on sparsely-sampled tree search. It then guides online planning using this learned policy for real-time vehicle control. These two steps are repeated to form a close loop so that the planner and the learner inform each other and both improve in synchrony. The entire algorithm evolves on its own in a self-supervised manner, without explicit human efforts on data labeling. We applied LeTS-Drive to autonomous driving in crowded urban environments in simulation. Experimental results clearly show that LeTS-Drive outperforms either planning or learning alone, as well as open-loop integration of planning and learning.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2101.03834 [cs.RO]
	(or arXiv:2101.03834v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2101.03834

Submission history

From: Panpan Cai [view email]
[v1] Mon, 11 Jan 2021 11:59:09 UTC (4,931 KB)
[v2] Fri, 16 Jul 2021 08:28:21 UTC (6,861 KB)
[v3] Tue, 9 Aug 2022 09:46:50 UTC (7,179 KB)

Computer Science > Robotics

Title:Closing the Planning-Learning Loop with Application to Autonomous Driving in a Crowd

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Closing the Planning-Learning Loop with Application to Autonomous Driving in a Crowd

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators