PoseFix: Correcting 3D Human Poses with Natural Language

Delmas, Ginger; Weinzaepfel, Philippe; Moreno-Noguer, Francesc; Rogez, Grégory

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.08480 (cs)

[Submitted on 15 Sep 2023 (v1), last revised 17 Jan 2024 (this version, v2)]

Title:PoseFix: Correcting 3D Human Poses with Natural Language

Authors:Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez

View PDF HTML (experimental)

Abstract:Automatically producing instructions to modify one's posture could open the door to endless applications, such as personalized coaching and in-home physical therapy. Tackling the reverse problem (i.e., refining a 3D pose based on some natural language feedback) could help for assisted 3D character animation or robot teaching, for instance. Although a few recent works explore the connections between natural language and 3D human pose, none focus on describing 3D body pose differences. In this paper, we tackle the problem of correcting 3D human poses with natural language. To this end, we introduce the PoseFix dataset, which consists of several thousand paired 3D poses and their corresponding text feedback, that describe how the source pose needs to be modified to obtain the target pose. We demonstrate the potential of this dataset on two tasks: (1) text-based pose editing, that aims at generating corrected 3D body poses given a query pose and a text modifier; and (2) correctional text generation, where instructions are generated based on the differences between two body poses.

Comments:	Published in ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2309.08480 [cs.CV]
	(or arXiv:2309.08480v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.08480

Submission history

From: Ginger Delmas [view email]
[v1] Fri, 15 Sep 2023 15:36:50 UTC (5,828 KB)
[v2] Wed, 17 Jan 2024 10:09:14 UTC (5,828 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PoseFix: Correcting 3D Human Poses with Natural Language

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PoseFix: Correcting 3D Human Poses with Natural Language

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators