Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution

Mason, Justice; Allen-Blanchette, Christine; Zolman, Nicholas; Davison, Elizabeth; Leonard, Naomi Ehrich

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.14666 (cs)

[Submitted on 24 Aug 2023 (v1), last revised 10 Apr 2024 (this version, v2)]

Title:Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution

Authors:Justice Mason, Christine Allen-Blanchette, Nicholas Zolman, Elizabeth Davison, Naomi Ehrich Leonard

View PDF HTML (experimental)

Abstract:In many real-world settings, image observations of freely rotating 3D rigid bodies may be available when low-dimensional measurements are not. However, the high-dimensionality of image data precludes the use of classical estimation techniques to learn the dynamics. The usefulness of standard deep learning methods is also limited, because an image of a rigid body reveals nothing about the distribution of mass inside the body, which, together with initial angular velocity, is what determines how the body will rotate. We present a physics-based neural network model to estimate and predict 3D rotational dynamics from image sequences. We achieve this using a multi-stage prediction pipeline that maps individual images to a latent representation homeomorphic to $\mathbf{SO}(3)$, computes angular velocities from latent pairs, and predicts future latent states using the Hamiltonian equations of motion. We demonstrate the efficacy of our approach on new rotating rigid-body datasets of sequences of synthetic images of rotating objects, including cubes, prisms and satellites, with unknown uniform and non-uniform mass distributions. Our model outperforms competing baselines on our datasets, producing better qualitative predictions and reducing the error observed for the state-of-the-art Hamiltonian Generative Network by a factor of 2.

Comments:	Previously appeared as arXiv:2209.11355v2, which was submitted as a replacement by accident. arXiv admin note: text overlap with arXiv:2209.11355
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
Cite as:	arXiv:2308.14666 [cs.CV]
	(or arXiv:2308.14666v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.14666

Submission history

From: Justice Mason [view email]
[v1] Thu, 24 Aug 2023 17:47:32 UTC (13,229 KB)
[v2] Wed, 10 Apr 2024 23:39:38 UTC (21,030 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators