Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories

Yang, Pengzhi; Koga, Shumon; Asgharivaskasi, Arash; Atanasov, Nikolay

Computer Science > Robotics

arXiv:2212.01498 (cs)

[Submitted on 3 Dec 2022 (v1), last revised 16 May 2023 (this version, v2)]

Title:Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories

Authors:Pengzhi Yang, Shumon Koga, Arash Asgharivaskasi, Nikolay Atanasov

View PDF

Abstract:This paper proposes a novel model-based policy gradient algorithm for tracking dynamic targets using a mobile robot, equipped with an onboard sensor with limited field of view. The task is to obtain a continuous control policy for the mobile robot to collect sensor measurements that reduce uncertainty in the target states, measured by the target distribution entropy. We design a neural network control policy with the robot $SE(3)$ pose and the mean vector and information matrix of the joint target distribution as inputs and attention layers to handle variable numbers of targets. We also derive the gradient of the target entropy with respect to the network parameters explicitly, allowing efficient model-based policy gradient optimization.

Comments:	12 pages, 2 figures, submitted to Learning for Dynamics and Control Conference
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2212.01498 [cs.RO]
	(or arXiv:2212.01498v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2212.01498

Submission history

From: Shumon Koga [view email]
[v1] Sat, 3 Dec 2022 01:10:44 UTC (494 KB)
[v2] Tue, 16 May 2023 18:25:19 UTC (507 KB)

Computer Science > Robotics

Title:Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators