-
FocalPose++: Focal Length and Object Pose Estimation via Render and Compare
Authors:
Martin Cífka,
Georgy Ponimatkin,
Yann Labbé,
Bryan Russell,
Mathieu Aubry,
Vladimir Petrik,
Josef Sivic
Abstract:
We introduce FocalPose++, a neural render-and-compare method for jointly estimating the camera-object 6D pose and camera focal length given a single RGB input image depicting a known object. The contributions of this work are threefold. First, we derive a focal length update rule that extends an existing state-of-the-art render-and-compare 6D pose estimator to address the joint estimation task. Se…
▽ More
We introduce FocalPose++, a neural render-and-compare method for jointly estimating the camera-object 6D pose and camera focal length given a single RGB input image depicting a known object. The contributions of this work are threefold. First, we derive a focal length update rule that extends an existing state-of-the-art render-and-compare 6D pose estimator to address the joint estimation task. Second, we investigate several different loss functions for jointly estimating the object pose and focal length. We find that a combination of direct focal length regression with a reprojection loss disentangling the contribution of translation, rotation, and focal length leads to improved results. Third, we explore the effect of different synthetic training data on the performance of our method. Specifically, we investigate different distributions used for sampling object's 6D pose and camera's focal length when rendering the synthetic images, and show that parametric distribution fitted on real training data works the best. We show results on three challenging benchmark datasets that depict known 3D models in uncontrolled settings. We demonstrate that our focal length and 6D pose estimates have lower error than the existing state-of-the-art methods.
△ Less
Submitted 15 November, 2023;
originally announced December 2023.
-
Visually Guided Model Predictive Robot Control via 6D Object Pose Localization and Tracking
Authors:
Mederic Fourmy,
Vojtech Priban,
Jan Kristof Behrens,
Nicolas Mansard,
Josef Sivic,
Vladimir Petrik
Abstract:
The objective of this work is to enable manipulation tasks with respect to the 6D pose of a dynamically moving object using a camera mounted on a robot. Examples include maintaining a constant relative 6D pose of the robot arm with respect to the object, gras** the dynamically moving object, or co-manipulating the object together with a human. Fast and accurate 6D pose estimation is crucial to a…
▽ More
The objective of this work is to enable manipulation tasks with respect to the 6D pose of a dynamically moving object using a camera mounted on a robot. Examples include maintaining a constant relative 6D pose of the robot arm with respect to the object, gras** the dynamically moving object, or co-manipulating the object together with a human. Fast and accurate 6D pose estimation is crucial to achieve smooth and stable robot control in such situations. The contributions of this work are three fold. First, we propose a new visual perception module that asynchronously combines accurate learning-based 6D object pose localizer and a high-rate model-based 6D pose tracker. The outcome is a low-latency accurate and temporally consistent 6D object pose estimation from the input video stream at up to 120 Hz. Second, we develop a visually guided robot arm controller that combines the new visual perception module with a torque-based model predictive control algorithm. Asynchronous combination of the visual and robot proprioception signals at their corresponding frequencies results in stable and robust 6D object pose guided robot arm control. Third, we experimentally validate the proposed approach on a challenging 6D pose estimation benchmark and demonstrate 6D object pose-guided control with dynamically moving objects on a real 7 DoF Franka Emika Panda robot.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Differentiable Collision Detection: a Randomized Smoothing Approach
Authors:
Louis Montaut,
Quentin Le Lidec,
Antoine Bambade,
Vladimir Petrik,
Josef Sivic,
Justin Carpentier
Abstract:
Collision detection appears as a canonical operation in a large range of robotics applications from robot control to simulation, including motion planning and estimation. While the seminal works on the topic date back to the 80s, it is only recently that the question of properly differentiating collision detection has emerged as a central issue, thanks notably to the ongoing and various efforts ma…
▽ More
Collision detection appears as a canonical operation in a large range of robotics applications from robot control to simulation, including motion planning and estimation. While the seminal works on the topic date back to the 80s, it is only recently that the question of properly differentiating collision detection has emerged as a central issue, thanks notably to the ongoing and various efforts made by the scientific community around the topic of differentiable physics. Yet, very few solutions have been suggested so far, and only with a strong assumption on the nature of the shapes involved. In this work, we introduce a generic and efficient approach to compute the derivatives of collision detection for any pair of convex shapes, by notably leveraging randomized smoothing techniques which have shown to be particularly adapted to capture the derivatives of non-smooth problems. This approach is implemented in the HPP-FCL and Pinocchio ecosystems, and evaluated on classic datasets and problems of the robotics literature, demonstrating few micro-second timings to compute informative derivatives directly exploitable by many real robotic applications including differentiable simulation.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
Learning Object Manipulation Skills from Video via Approximate Differentiable Physics
Authors:
Vladimir Petrik,
Mohammad Nomaan Qureshi,
Josef Sivic,
Makarand Tapaswi
Abstract:
We aim to teach robots to perform simple object manipulation tasks by watching a single video demonstration. Towards this goal, we propose an optimization approach that outputs a coarse and temporally evolving 3D scene to mimic the action demonstrated in the input video. Similar to previous work, a differentiable renderer ensures perceptual fidelity between the 3D scene and the 2D video. Our key n…
▽ More
We aim to teach robots to perform simple object manipulation tasks by watching a single video demonstration. Towards this goal, we propose an optimization approach that outputs a coarse and temporally evolving 3D scene to mimic the action demonstrated in the input video. Similar to previous work, a differentiable renderer ensures perceptual fidelity between the 3D scene and the 2D video. Our key novelty lies in the inclusion of a differentiable approach to solve a set of Ordinary Differential Equations (ODEs) that allows us to approximately model laws of physics such as gravity, friction, and hand-object or object-object interactions. This not only enables us to dramatically improve the quality of estimated hand and object states, but also produces physically admissible trajectories that can be directly translated to a robot without the need for costly reinforcement learning. We evaluate our approach on a 3D reconstruction task that consists of 54 video demonstrations sourced from 9 actions such as pull something from right to left or put something in front of something. Our approach improves over previous state-of-the-art by almost 30%, demonstrating superior quality on especially challenging actions involving physical interactions of two objects such as put something onto something. Finally, we showcase the learned skills on a Franka Emika Panda robot.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Collision Detection Accelerated: An Optimization Perspective
Authors:
Louis Montaut,
Quentin Le Lidec,
Vladimir Petrik,
Josef Sivic,
Justin Carpentier
Abstract:
Collision detection between two convex shapes is an essential feature of any physics engine or robot motion planner. It has often been tackled as a computational geometry problem, with the Gilbert, Johnson and Keerthi (GJK) algorithm being the most common approach today. In this work we leverage the fact that collision detection is fundamentally a convex optimization problem. In particular, we est…
▽ More
Collision detection between two convex shapes is an essential feature of any physics engine or robot motion planner. It has often been tackled as a computational geometry problem, with the Gilbert, Johnson and Keerthi (GJK) algorithm being the most common approach today. In this work we leverage the fact that collision detection is fundamentally a convex optimization problem. In particular, we establish that the GJK algorithm is a specific sub-case of the well-established Frank-Wolfe (FW) algorithm in convex optimization. We introduce a new collision detection algorithm by adapting recent works linking Nesterov acceleration and Frank-Wolfe methods. We benchmark the proposed accelerated collision detection method on two datasets composed of strictly convex and non-strictly convex shapes. Our results show that our approach significantly reduces the number of iterations to solve collision detection problems compared to the state-of-the-art GJK algorithm, leading to up to two times faster computation times.
△ Less
Submitted 20 May, 2022; v1 submitted 19 May, 2022;
originally announced May 2022.
-
Learning to Manipulate Tools by Aligning Simulation to Video Demonstration
Authors:
Kateryna Zorina,
Justin Carpentier,
Josef Sivic,
Vladimír Petrík
Abstract:
A seamless integration of robots into human environments requires robots to learn how to use existing human tools. Current approaches for learning tool manipulation skills mostly rely on expert demonstrations provided in the target robot environment, for example, by manually guiding the robot manipulator or by teleoperation. In this work, we introduce an automated approach that replaces an expert…
▽ More
A seamless integration of robots into human environments requires robots to learn how to use existing human tools. Current approaches for learning tool manipulation skills mostly rely on expert demonstrations provided in the target robot environment, for example, by manually guiding the robot manipulator or by teleoperation. In this work, we introduce an automated approach that replaces an expert demonstration with a Youtube video for learning a tool manipulation strategy. The main contributions are twofold. First, we design an alignment procedure that aligns the simulated environment with the real-world scene observed in the video. This is formulated as an optimization problem that finds a spatial alignment of the tool trajectory to maximize the sparse goal reward given by the environment. Second, we describe an imitation learning approach that focuses on the trajectory of the tool rather than the motion of the human. For this we combine reinforcement learning with an optimization procedure to find a control policy and the placement of the robot based on the tool motion in the aligned environment. We demonstrate the proposed approach on spade, scythe and hammer tools in simulation, and show the effectiveness of the trained policy for the spade on a real Franka Emika Panda robot demonstration.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
Learning Object Manipulation Skills via Approximate State Estimation from Real Videos
Authors:
Vladimír Petrík,
Makarand Tapaswi,
Ivan Laptev,
Josef Sivic
Abstract:
Humans are adept at learning new tasks by watching a few instructional videos. On the other hand, robots that learn new actions either require a lot of effort through trial and error, or use expert demonstrations that are challenging to obtain. In this paper, we explore a method that facilitates learning object manipulation skills directly from videos. Leveraging recent advances in 2D visual recog…
▽ More
Humans are adept at learning new tasks by watching a few instructional videos. On the other hand, robots that learn new actions either require a lot of effort through trial and error, or use expert demonstrations that are challenging to obtain. In this paper, we explore a method that facilitates learning object manipulation skills directly from videos. Leveraging recent advances in 2D visual recognition and differentiable rendering, we develop an optimization based method to estimate a coarse 3D state representation for the hand and the manipulated object(s) without requiring any supervision. We use these trajectories as dense rewards for an agent that learns to mimic them through reinforcement learning. We evaluate our method on simple single- and two-object actions from the Something-Something dataset. Our approach allows an agent to learn actions from single videos, while watching multiple demonstrations makes the policy more robust. We show that policies learned in a simulated environment can be easily transferred to a real robot.
△ Less
Submitted 13 November, 2020;
originally announced November 2020.
-
Feedback-based Fabric Strip Folding
Authors:
Vladimír Petrík,
Ville Kyrki
Abstract:
Accurate manipulation of a deformable body such as a piece of fabric is difficult because of its many degrees of freedom and unobservable properties affecting its dynamics. To alleviate these challenges, we propose the application of feedback-based control to robotic fabric strip folding. The feedback is computed from the low dimensional state extracted from a camera image. We trained the controll…
▽ More
Accurate manipulation of a deformable body such as a piece of fabric is difficult because of its many degrees of freedom and unobservable properties affecting its dynamics. To alleviate these challenges, we propose the application of feedback-based control to robotic fabric strip folding. The feedback is computed from the low dimensional state extracted from a camera image. We trained the controller using reinforcement learning in simulation which was calibrated to cover the real fabric strip behaviors. The proposed feedback-based folding was experimentally compared to two state-of-the-art folding methods and our method outperformed both of them in terms of accuracy.
△ Less
Submitted 2 April, 2019;
originally announced April 2019.
-
Static Stability of Robotic Fabric Strip Folding
Authors:
Vladimír Petrík,
Vladimír Smutný,
Ville Kyrki
Abstract:
Planning accurate manipulation for deformable objects requires prediction of their state. The prediction is often complicated by a loss of stability that may result in collapse of the deformable object. In this work, stability of a fabric strip folding performed by a robot is studied. We show that there is a static instability in the folding process. This instability is detected in a physics-based…
▽ More
Planning accurate manipulation for deformable objects requires prediction of their state. The prediction is often complicated by a loss of stability that may result in collapse of the deformable object. In this work, stability of a fabric strip folding performed by a robot is studied. We show that there is a static instability in the folding process. This instability is detected in a physics-based simulation and the position of the instability is verified experimentally by real robotic manipulation. Three state-of-the-art methods for folding are assessed in the presence of static instability. It is shown that one of the existing folding paths is suitable for folding of materials with internal friction such as fabrics. Another folding path that utilizes dynamic motion exists for ideal elastic materials without internal friction. Our results show that instability needs to be considered in planning to obtain accurate manipulation of deformable objects.
△ Less
Submitted 28 February, 2019;
originally announced February 2019.
-
Ab initio based analysis of grain boundary segregation in Al-Mg and Al-Zn binary alloys
Authors:
M. V. Petrik,
A. R. Kuznetsov,
N. Enikeev,
Yu. N. Gornostyrev,
R. Z. Valiev
Abstract:
Based on ab-initio simulations, we report on the nature of principally different mechanisms for interaction of Mg and Zn atoms with grain boundaries in Al alloys leading to different morphology of segregation. The Mg atoms segregate in relatively wide GB region with heterogeneous agglomerations due to the deformation mechanism of solute-GB interaction. In contrast, in the case of Zn atoms an elect…
▽ More
Based on ab-initio simulations, we report on the nature of principally different mechanisms for interaction of Mg and Zn atoms with grain boundaries in Al alloys leading to different morphology of segregation. The Mg atoms segregate in relatively wide GB region with heterogeneous agglomerations due to the deformation mechanism of solute-GB interaction. In contrast, in the case of Zn atoms an electronic mechanism associated with the formation of directional bonding is dominating in the solute-GB interaction. As a result, for Zn atoms it is energetically beneficial to occupy interstitial positions at the very GB and to be arranged into thin layers along the GBs. The results obtained show the essential role of elements chemistry in segregation formation and explain the qualitative features in morphology of GB segregation observed in Al-Mg and Al-Zn alloys with ultrafine grains.
△ Less
Submitted 1 August, 2017;
originally announced August 2017.
-
Role of magnetic degrees of freedom in a scenario of phase transformations in steel
Authors:
I. K. Razumov,
D. V. Boukhvalov,
M. V. Petrik,
V. N. Urtsev,
A. V. Shmakov,
M. I. Katsnelson,
Yu. N. Gornostyrev
Abstract:
The diversity of mesostructures formed in steel at cooling from a high-temperature austenite ("gamma") phase is determined by the interplay of shear reconstructions of crystal lattice and diffusion of carbon. Combining first-principles calculations with large-scale phase-field simulations we demonstrate a decisive role of magnetic degrees of freedom in the formation of energy relief along the Bain…
▽ More
The diversity of mesostructures formed in steel at cooling from a high-temperature austenite ("gamma") phase is determined by the interplay of shear reconstructions of crystal lattice and diffusion of carbon. Combining first-principles calculations with large-scale phase-field simulations we demonstrate a decisive role of magnetic degrees of freedom in the formation of energy relief along the Bain path of "gamma"-"alpha" transformation and, thus, in this interplay. We show that there is the main factor, namely, the magnetic state of iron and its evolution with temperature which controls the change in character of the transformation. Based on the computational results we propose a simple model which reproduces, in good agreement with experiment, the most important curves of the phase transformation in Fe-C, namely, the lines relevant to a start of ferrite, bainite, and martensite transformations. Phase-field simulations within the model describe qualitatively typical patterns at these transformations.
△ Less
Submitted 4 September, 2014; v1 submitted 14 August, 2014;
originally announced August 2014.