Search | arXiv e-print repository

FocalPose++: Focal Length and Object Pose Estimation via Render and Compare

Authors: Martin Cífka, Georgy Ponimatkin, Yann Labbé, Bryan Russell, Mathieu Aubry, Vladimir Petrik, Josef Sivic

Abstract: We introduce FocalPose++, a neural render-and-compare method for jointly estimating the camera-object 6D pose and camera focal length given a single RGB input image depicting a known object. The contributions of this work are threefold. First, we derive a focal length update rule that extends an existing state-of-the-art render-and-compare 6D pose estimator to address the joint estimation task. Se… ▽ More We introduce FocalPose++, a neural render-and-compare method for jointly estimating the camera-object 6D pose and camera focal length given a single RGB input image depicting a known object. The contributions of this work are threefold. First, we derive a focal length update rule that extends an existing state-of-the-art render-and-compare 6D pose estimator to address the joint estimation task. Second, we investigate several different loss functions for jointly estimating the object pose and focal length. We find that a combination of direct focal length regression with a reprojection loss disentangling the contribution of translation, rotation, and focal length leads to improved results. Third, we explore the effect of different synthetic training data on the performance of our method. Specifically, we investigate different distributions used for sampling object's 6D pose and camera's focal length when rendering the synthetic images, and show that parametric distribution fitted on real training data works the best. We show results on three challenging benchmark datasets that depict known 3D models in uncontrolled settings. We demonstrate that our focal length and 6D pose estimates have lower error than the existing state-of-the-art methods. △ Less

Submitted 15 November, 2023; originally announced December 2023.

Comments: 21 pages, 18 figures. arXiv admin note: substantial text overlap with arXiv:2204.05145

arXiv:2311.05344 [pdf, other]

Visually Guided Model Predictive Robot Control via 6D Object Pose Localization and Tracking

Authors: Mederic Fourmy, Vojtech Priban, Jan Kristof Behrens, Nicolas Mansard, Josef Sivic, Vladimir Petrik

Abstract: The objective of this work is to enable manipulation tasks with respect to the 6D pose of a dynamically moving object using a camera mounted on a robot. Examples include maintaining a constant relative 6D pose of the robot arm with respect to the object, gras** the dynamically moving object, or co-manipulating the object together with a human. Fast and accurate 6D pose estimation is crucial to a… ▽ More The objective of this work is to enable manipulation tasks with respect to the 6D pose of a dynamically moving object using a camera mounted on a robot. Examples include maintaining a constant relative 6D pose of the robot arm with respect to the object, gras** the dynamically moving object, or co-manipulating the object together with a human. Fast and accurate 6D pose estimation is crucial to achieve smooth and stable robot control in such situations. The contributions of this work are three fold. First, we propose a new visual perception module that asynchronously combines accurate learning-based 6D object pose localizer and a high-rate model-based 6D pose tracker. The outcome is a low-latency accurate and temporally consistent 6D object pose estimation from the input video stream at up to 120 Hz. Second, we develop a visually guided robot arm controller that combines the new visual perception module with a torque-based model predictive control algorithm. Asynchronous combination of the visual and robot proprioception signals at their corresponding frequencies results in stable and robust 6D object pose guided robot arm control. Third, we experimentally validate the proposed approach on a challenging 6D pose estimation benchmark and demonstrate 6D object pose-guided control with dynamically moving objects on a real 7 DoF Franka Emika Panda robot. △ Less

Submitted 9 November, 2023; originally announced November 2023.

arXiv:2209.09012 [pdf, other]

Differentiable Collision Detection: a Randomized Smoothing Approach

Authors: Louis Montaut, Quentin Le Lidec, Antoine Bambade, Vladimir Petrik, Josef Sivic, Justin Carpentier

Abstract: Collision detection appears as a canonical operation in a large range of robotics applications from robot control to simulation, including motion planning and estimation. While the seminal works on the topic date back to the 80s, it is only recently that the question of properly differentiating collision detection has emerged as a central issue, thanks notably to the ongoing and various efforts ma… ▽ More Collision detection appears as a canonical operation in a large range of robotics applications from robot control to simulation, including motion planning and estimation. While the seminal works on the topic date back to the 80s, it is only recently that the question of properly differentiating collision detection has emerged as a central issue, thanks notably to the ongoing and various efforts made by the scientific community around the topic of differentiable physics. Yet, very few solutions have been suggested so far, and only with a strong assumption on the nature of the shapes involved. In this work, we introduce a generic and efficient approach to compute the derivatives of collision detection for any pair of convex shapes, by notably leveraging randomized smoothing techniques which have shown to be particularly adapted to capture the derivatives of non-smooth problems. This approach is implemented in the HPP-FCL and Pinocchio ecosystems, and evaluated on classic datasets and problems of the robotics literature, demonstrating few micro-second timings to compute informative derivatives directly exploitable by many real robotic applications including differentiable simulation. △ Less

Submitted 19 September, 2022; originally announced September 2022.

Comments: 7 pages, 6 figures, 2 tables

arXiv:2208.01960 [pdf, other]

Learning Object Manipulation Skills from Video via Approximate Differentiable Physics

Authors: Vladimir Petrik, Mohammad Nomaan Qureshi, Josef Sivic, Makarand Tapaswi

Abstract: We aim to teach robots to perform simple object manipulation tasks by watching a single video demonstration. Towards this goal, we propose an optimization approach that outputs a coarse and temporally evolving 3D scene to mimic the action demonstrated in the input video. Similar to previous work, a differentiable renderer ensures perceptual fidelity between the 3D scene and the 2D video. Our key n… ▽ More We aim to teach robots to perform simple object manipulation tasks by watching a single video demonstration. Towards this goal, we propose an optimization approach that outputs a coarse and temporally evolving 3D scene to mimic the action demonstrated in the input video. Similar to previous work, a differentiable renderer ensures perceptual fidelity between the 3D scene and the 2D video. Our key novelty lies in the inclusion of a differentiable approach to solve a set of Ordinary Differential Equations (ODEs) that allows us to approximately model laws of physics such as gravity, friction, and hand-object or object-object interactions. This not only enables us to dramatically improve the quality of estimated hand and object states, but also produces physically admissible trajectories that can be directly translated to a robot without the need for costly reinforcement learning. We evaluate our approach on a 3D reconstruction task that consists of 54 video demonstrations sourced from 9 actions such as pull something from right to left or put something in front of something. Our approach improves over previous state-of-the-art by almost 30%, demonstrating superior quality on especially challenging actions involving physical interactions of two objects such as put something onto something. Finally, we showcase the learned skills on a Franka Emika Panda robot. △ Less

Submitted 3 August, 2022; originally announced August 2022.

Comments: Accepted for IROS2022, code at https://github.com/petrikvladimir/video_skills_learning_with_approx_physics, project page at https://data.ciirc.cvut.cz/public/projects/2022Real2SimPhysics/

arXiv:2205.09663 [pdf, other]

Collision Detection Accelerated: An Optimization Perspective

Authors: Louis Montaut, Quentin Le Lidec, Vladimir Petrik, Josef Sivic, Justin Carpentier

Abstract: Collision detection between two convex shapes is an essential feature of any physics engine or robot motion planner. It has often been tackled as a computational geometry problem, with the Gilbert, Johnson and Keerthi (GJK) algorithm being the most common approach today. In this work we leverage the fact that collision detection is fundamentally a convex optimization problem. In particular, we est… ▽ More Collision detection between two convex shapes is an essential feature of any physics engine or robot motion planner. It has often been tackled as a computational geometry problem, with the Gilbert, Johnson and Keerthi (GJK) algorithm being the most common approach today. In this work we leverage the fact that collision detection is fundamentally a convex optimization problem. In particular, we establish that the GJK algorithm is a specific sub-case of the well-established Frank-Wolfe (FW) algorithm in convex optimization. We introduce a new collision detection algorithm by adapting recent works linking Nesterov acceleration and Frank-Wolfe methods. We benchmark the proposed accelerated collision detection method on two datasets composed of strictly convex and non-strictly convex shapes. Our results show that our approach significantly reduces the number of iterations to solve collision detection problems compared to the state-of-the-art GJK algorithm, leading to up to two times faster computation times. △ Less

Submitted 20 May, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

Comments: RSS 2022, 12 pages, 9 figures, 2 tables

Journal ref: Robotics: Science and Systems 2022

arXiv:2111.03088 [pdf, other]

Learning to Manipulate Tools by Aligning Simulation to Video Demonstration

Authors: Kateryna Zorina, Justin Carpentier, Josef Sivic, Vladimír Petrík

Abstract: A seamless integration of robots into human environments requires robots to learn how to use existing human tools. Current approaches for learning tool manipulation skills mostly rely on expert demonstrations provided in the target robot environment, for example, by manually guiding the robot manipulator or by teleoperation. In this work, we introduce an automated approach that replaces an expert… ▽ More A seamless integration of robots into human environments requires robots to learn how to use existing human tools. Current approaches for learning tool manipulation skills mostly rely on expert demonstrations provided in the target robot environment, for example, by manually guiding the robot manipulator or by teleoperation. In this work, we introduce an automated approach that replaces an expert demonstration with a Youtube video for learning a tool manipulation strategy. The main contributions are twofold. First, we design an alignment procedure that aligns the simulated environment with the real-world scene observed in the video. This is formulated as an optimization problem that finds a spatial alignment of the tool trajectory to maximize the sparse goal reward given by the environment. Second, we describe an imitation learning approach that focuses on the trajectory of the tool rather than the motion of the human. For this we combine reinforcement learning with an optimization procedure to find a control policy and the placement of the robot based on the tool motion in the aligned environment. We demonstrate the proposed approach on spade, scythe and hammer tools in simulation, and show the effectiveness of the trained policy for the spade on a real Franka Emika Panda robot demonstration. △ Less

Submitted 4 November, 2021; originally announced November 2021.

Comments: Accepted to IEEE Robotics and Automation Letters (RA-L)

arXiv:2011.06813 [pdf, other]

Learning Object Manipulation Skills via Approximate State Estimation from Real Videos

Authors: Vladimír Petrík, Makarand Tapaswi, Ivan Laptev, Josef Sivic

Abstract: Humans are adept at learning new tasks by watching a few instructional videos. On the other hand, robots that learn new actions either require a lot of effort through trial and error, or use expert demonstrations that are challenging to obtain. In this paper, we explore a method that facilitates learning object manipulation skills directly from videos. Leveraging recent advances in 2D visual recog… ▽ More Humans are adept at learning new tasks by watching a few instructional videos. On the other hand, robots that learn new actions either require a lot of effort through trial and error, or use expert demonstrations that are challenging to obtain. In this paper, we explore a method that facilitates learning object manipulation skills directly from videos. Leveraging recent advances in 2D visual recognition and differentiable rendering, we develop an optimization based method to estimate a coarse 3D state representation for the hand and the manipulated object(s) without requiring any supervision. We use these trajectories as dense rewards for an agent that learns to mimic them through reinforcement learning. We evaluate our method on simple single- and two-object actions from the Something-Something dataset. Our approach allows an agent to learn actions from single videos, while watching multiple demonstrations makes the policy more robust. We show that policies learned in a simulated environment can be easily transferred to a real robot. △ Less

Submitted 13 November, 2020; originally announced November 2020.

Comments: CoRL 2020, code at https://github.com/makarandtapaswi/Real2Sim_CoRL2020, project page at https://data.ciirc.cvut.cz/public/projects/2020Real2Sim/

arXiv:1904.01298 [pdf, other]

Feedback-based Fabric Strip Folding

Authors: Vladimír Petrík, Ville Kyrki

Abstract: Accurate manipulation of a deformable body such as a piece of fabric is difficult because of its many degrees of freedom and unobservable properties affecting its dynamics. To alleviate these challenges, we propose the application of feedback-based control to robotic fabric strip folding. The feedback is computed from the low dimensional state extracted from a camera image. We trained the controll… ▽ More Accurate manipulation of a deformable body such as a piece of fabric is difficult because of its many degrees of freedom and unobservable properties affecting its dynamics. To alleviate these challenges, we propose the application of feedback-based control to robotic fabric strip folding. The feedback is computed from the low dimensional state extracted from a camera image. We trained the controller using reinforcement learning in simulation which was calibrated to cover the real fabric strip behaviors. The proposed feedback-based folding was experimentally compared to two state-of-the-art folding methods and our method outperformed both of them in terms of accuracy. △ Less

Submitted 2 April, 2019; originally announced April 2019.

Comments: Submitted to IEEE/RSJ IROS2019

arXiv:1902.11021 [pdf, other]

Static Stability of Robotic Fabric Strip Folding

Authors: Vladimír Petrík, Vladimír Smutný, Ville Kyrki

Abstract: Planning accurate manipulation for deformable objects requires prediction of their state. The prediction is often complicated by a loss of stability that may result in collapse of the deformable object. In this work, stability of a fabric strip folding performed by a robot is studied. We show that there is a static instability in the folding process. This instability is detected in a physics-based… ▽ More Planning accurate manipulation for deformable objects requires prediction of their state. The prediction is often complicated by a loss of stability that may result in collapse of the deformable object. In this work, stability of a fabric strip folding performed by a robot is studied. We show that there is a static instability in the folding process. This instability is detected in a physics-based simulation and the position of the instability is verified experimentally by real robotic manipulation. Three state-of-the-art methods for folding are assessed in the presence of static instability. It is shown that one of the existing folding paths is suitable for folding of materials with internal friction such as fabrics. Another folding path that utilizes dynamic motion exists for ideal elastic materials without internal friction. Our results show that instability needs to be considered in planning to obtain accurate manipulation of deformable objects. △ Less

Submitted 28 February, 2019; originally announced February 2019.

Comments: In review for IEEE/ASME Transactions On Mechatronics

arXiv:1708.00211 [pdf]

Ab initio based analysis of grain boundary segregation in Al-Mg and Al-Zn binary alloys

Authors: M. V. Petrik, A. R. Kuznetsov, N. Enikeev, Yu. N. Gornostyrev, R. Z. Valiev

Abstract: Based on ab-initio simulations, we report on the nature of principally different mechanisms for interaction of Mg and Zn atoms with grain boundaries in Al alloys leading to different morphology of segregation. The Mg atoms segregate in relatively wide GB region with heterogeneous agglomerations due to the deformation mechanism of solute-GB interaction. In contrast, in the case of Zn atoms an elect… ▽ More Based on ab-initio simulations, we report on the nature of principally different mechanisms for interaction of Mg and Zn atoms with grain boundaries in Al alloys leading to different morphology of segregation. The Mg atoms segregate in relatively wide GB region with heterogeneous agglomerations due to the deformation mechanism of solute-GB interaction. In contrast, in the case of Zn atoms an electronic mechanism associated with the formation of directional bonding is dominating in the solute-GB interaction. As a result, for Zn atoms it is energetically beneficial to occupy interstitial positions at the very GB and to be arranged into thin layers along the GBs. The results obtained show the essential role of elements chemistry in segregation formation and explain the qualitative features in morphology of GB segregation observed in Al-Mg and Al-Zn alloys with ultrafine grains. △ Less

Submitted 1 August, 2017; originally announced August 2017.

Comments: 5 pages, 3 figures, APL

arXiv:1408.3275 [pdf]

doi 10.1103/PhysRevB.90.094101

Role of magnetic degrees of freedom in a scenario of phase transformations in steel

Authors: I. K. Razumov, D. V. Boukhvalov, M. V. Petrik, V. N. Urtsev, A. V. Shmakov, M. I. Katsnelson, Yu. N. Gornostyrev

Abstract: The diversity of mesostructures formed in steel at cooling from a high-temperature austenite ("gamma") phase is determined by the interplay of shear reconstructions of crystal lattice and diffusion of carbon. Combining first-principles calculations with large-scale phase-field simulations we demonstrate a decisive role of magnetic degrees of freedom in the formation of energy relief along the Bain… ▽ More The diversity of mesostructures formed in steel at cooling from a high-temperature austenite ("gamma") phase is determined by the interplay of shear reconstructions of crystal lattice and diffusion of carbon. Combining first-principles calculations with large-scale phase-field simulations we demonstrate a decisive role of magnetic degrees of freedom in the formation of energy relief along the Bain path of "gamma"-"alpha" transformation and, thus, in this interplay. We show that there is the main factor, namely, the magnetic state of iron and its evolution with temperature which controls the change in character of the transformation. Based on the computational results we propose a simple model which reproduces, in good agreement with experiment, the most important curves of the phase transformation in Fe-C, namely, the lines relevant to a start of ferrite, bainite, and martensite transformations. Phase-field simulations within the model describe qualitatively typical patterns at these transformations. △ Less

Submitted 4 September, 2014; v1 submitted 14 August, 2014; originally announced August 2014.

Journal ref: Phys. Rev. B 90, 094101 (2014)

Showing 1–11 of 11 results for author: Petrik, V