Skip to main content

Showing 1–10 of 10 results for author: Prokudin, S

.
  1. arXiv:2406.03625  [pdf, other

    cs.CV cs.AI

    Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories

    Authors: Yan Zhang, Sergey Prokudin, Marko Mihajlovic, Qianli Ma, Siyu Tang

    Abstract: Understanding the dynamics of generic 3D scenes is fundamentally challenging in computer vision, essential in enhancing applications related to scene reconstruction, motion tracking, and avatar creation. In this work, we address the task as the problem of inferring dense, long-range motion of 3D points. By observing a set of point trajectories, we aim to learn an implicit motion field parameterize… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: cvpr24 post camera ready

  2. arXiv:2403.16736  [pdf, other

    cs.CV

    Creating a Digital Twin of Spinal Surgery: A Proof of Concept

    Authors: Jonas Hein, Frédéric Giraud, Lilian Calvet, Alexander Schwarz, Nicola Alessandro Cavalcanti, Sergey Prokudin, Mazda Farshad, Siyu Tang, Marc Pollefeys, Fabio Carrillo, Philipp Fürnstahl

    Abstract: Surgery digitalization is the process of creating a virtual replica of real-world surgery, also referred to as a surgical digital twin (SDT). It has significant applications in various fields such as education and training, surgical planning, and automation of surgical tasks. In addition, SDTs are an ideal foundation for machine learning methods, enabling the automatic generation of training data.… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted for the DCA in MI Workshop @ CVPR 2024. Project page: https://jonashein.github.io/surgerydigitization/

  3. arXiv:2401.04728  [pdf, other

    cs.CV cs.AI

    Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

    Authors: Xiyi Chen, Marko Mihajlovic, Shaofei Wang, Sergey Prokudin, Siyu Tang

    Abstract: Recent advances in generative diffusion models have enabled the previously unfeasible capability of generating 3D assets from a single input image or a text prompt. In this work, we aim to enhance the quality and functionality of these models for the task of creating controllable, photorealistic human avatars. We achieve this by integrating a 3D morphable model into the state-of-the-art multi-view… ▽ More

    Submitted 2 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: [CVPR 2024] Project page: https://xiyichen.github.io/morphablediffusion/

  4. arXiv:2309.03160  [pdf, other

    cs.CV

    ResFields: Residual Neural Fields for Spatiotemporal Signals

    Authors: Marko Mihajlovic, Sergey Prokudin, Marc Pollefeys, Siyu Tang

    Abstract: Neural fields, a category of neural networks trained to represent high-frequency signals, have gained significant attention in recent years due to their impressive performance in modeling complex 3D data, such as signed distance (SDFs) or radiance fields (NeRFs), via a single multi-layer perceptron (MLP). However, despite the power and simplicity of representing signals with an MLP, these methods… ▽ More

    Submitted 11 February, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: [ICLR 2024 Spotlight] Project and code at: https://markomih.github.io/ResFields/

  5. arXiv:2304.02626  [pdf, other

    cs.CV

    Dynamic Point Fields

    Authors: Sergey Prokudin, Qianli Ma, Maxime Raafat, Julien Valentin, Siyu Tang

    Abstract: Recent years have witnessed significant progress in the field of neural surface reconstruction. While the extensive focus was put on volumetric and implicit approaches, a number of works have shown that explicit graphics primitives such as point clouds can significantly reduce computational complexity, without sacrificing the reconstructed surface quality. However, less emphasis has been put on mo… ▽ More

    Submitted 6 April, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  6. arXiv:2212.09530  [pdf, other

    cs.CV

    HARP: Personalized Hand Reconstruction from a Monocular RGB Video

    Authors: Korrawe Karunratanakul, Sergey Prokudin, Otmar Hilliges, Siyu Tang

    Abstract: We present HARP (HAnd Reconstruction and Personalization), a personalized hand avatar creation approach that takes a short monocular RGB video of a human hand as input and reconstructs a faithful hand avatar exhibiting a high-fidelity appearance and geometry. In contrast to the major trend of neural implicit representations, HARP models a hand with a mesh-based parametric hand model, a vertex disp… ▽ More

    Submitted 3 July, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: CVPR 2023. Project page: https://korrawe.github.io/harp-project/

  7. arXiv:2008.06872  [pdf, other

    cs.CV

    SMPLpix: Neural Avatars from 3D Human Models

    Authors: Sergey Prokudin, Michael J. Black, Javier Romero

    Abstract: Recent advances in deep generative models have led to an unprecedented level of realism for synthetically generated images of humans. However, one of the remaining fundamental limitations of these models is the ability to flexibly control the generative process, e.g.~change the camera and human pose while retaining the subject identity. At the same time, deformable human body models like SMPL and… ▽ More

    Submitted 9 November, 2020; v1 submitted 16 August, 2020; originally announced August 2020.

  8. arXiv:1909.03895  [pdf, other

    cs.LG stat.ML

    Real Time Trajectory Prediction Using Deep Conditional Generative Models

    Authors: Sebastian Gomez-Gonzalez, Sergey Prokudin, Bernhard Scholkopf, Jan Peters

    Abstract: Data driven methods for time series forecasting that quantify uncertainty open new important possibilities for robot tasks with hard real time constraints, allowing the robot system to make decisions that trade off between reaction time and accuracy in the predictions. Despite the recent advances in deep learning, it is still challenging to make long term accurate predictions with the low latency… ▽ More

    Submitted 7 January, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

  9. arXiv:1908.09186  [pdf, other

    cs.CV

    Efficient Learning on Point Clouds with Basis Point Sets

    Authors: Sergey Prokudin, Christoph Lassner, Javier Romero

    Abstract: With the increased availability of 3D scanning technology, point clouds are moving into the focus of computer vision as a rich representation of everyday scenes. However, they are hard to handle for machine learning algorithms due to their unordered structure. One common approach is to apply occupancy grid map**, which dramatically increases the amount of data stored and at the same time loses d… ▽ More

    Submitted 24 August, 2019; originally announced August 2019.

    Comments: ICCV 2019

  10. arXiv:1805.03430  [pdf, other

    cs.CV

    Deep Directional Statistics: Pose Estimation with Uncertainty Quantification

    Authors: Sergey Prokudin, Peter Gehler, Sebastian Nowozin

    Abstract: Modern deep learning systems successfully solve many perception tasks such as object pose estimation when the input image is of high quality. However, in challenging imaging conditions such as on low-resolution images or when the image is corrupted by imaging artifacts, current systems degrade considerably in accuracy. While a loss in performance is unavoidable, we would like our models to quantif… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.