Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

Lim, Michael H.; Tomlin, Claire J.; Sunberg, Zachary N.

Computer Science > Machine Learning

arXiv:2012.10140 (cs)

[Submitted on 18 Dec 2020 (v1), last revised 1 Apr 2021 (this version, v3)]

Title:Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

Authors:Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

View PDF

Abstract:This paper introduces Voronoi Progressive Widening (VPW), a generalization of Voronoi optimistic optimization (VOO) and action progressive widening to partially observable Markov decision processes (POMDPs). Tree search algorithms can use VPW to effectively handle continuous or hybrid action spaces by efficiently balancing local and global action searching. This paper proposes two VPW-based algorithms and analyzes them from theoretical and simulation perspectives. Voronoi Optimistic Weighted Sparse Sampling (VOWSS) is a theoretical tool that justifies VPW-based online solvers, and it is the first algorithm with global convergence guarantees for continuous state, action, and observation POMDPs. Voronoi Optimistic Monte Carlo Planning with Observation Weighting (VOMCPOW) is a versatile and efficient algorithm that consistently outperforms state-of-the-art POMDP algorithms in several simulation experiments.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO); Systems and Control (eess.SY)
Cite as:	arXiv:2012.10140 [cs.LG]
	(or arXiv:2012.10140v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2012.10140

Submission history

From: Michael Lim [view email]
[v1] Fri, 18 Dec 2020 10:05:43 UTC (442 KB)
[v2] Fri, 26 Mar 2021 13:00:12 UTC (491 KB)
[v3] Thu, 1 Apr 2021 09:47:32 UTC (493 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
cs.AI
cs.RO
cs.SY
eess
eess.SY

References & Citations

DBLP - CS Bibliography

listing | bibtex

Claire J. Tomlin
Zachary N. Sunberg

export BibTeX citation

Computer Science > Machine Learning

Title:Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators