Skip to main content

Showing 1–12 of 12 results for author: Insafutdinov, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04343  [pdf, other

    cs.CV

    Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image

    Authors: Stanislaw Szymanowicz, Eldar Insafutdinov, Chuanxia Zheng, Dylan Campbell, João F. Henriques, Christian Rupprecht, Andrea Vedaldi

    Abstract: In this paper, we propose Flash3D, a method for scene reconstruction and novel view synthesis from a single image which is both very generalisable and efficient. For generalisability, we start from a "foundation" model for monocular depth estimation and extend it to a full 3D shape and appearance reconstructor. For efficiency, we base this extension on feed-forward Gaussian Splatting. Specifically… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project page: https://www.robots.ox.ac.uk/~vgg/research/flash3d/

  2. arXiv:2206.06340  [pdf, other

    cs.CV

    SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data

    Authors: Eldar Insafutdinov, Dylan Campbell, João F. Henriques, Andrea Vedaldi

    Abstract: We present a method for the accurate 3D reconstruction of partly-symmetric objects. We build on the strengths of recent advances in neural reconstruction and rendering such as Neural Radiance Fields (NeRF). A major shortcoming of such approaches is that they fail to reconstruct any part of the object which is not clearly visible in the training image, which is often the case for in-the-wild images… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: First two authors contributed equally

  3. arXiv:2109.07945  [pdf, other

    cs.CV cs.LG

    Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views

    Authors: Robert McCraith, Eldar Insafutdinov, Lukas Neumann, Andrea Vedaldi

    Abstract: We present a system for automatic converting of 2D mask object predictions and raw LiDAR point clouds into full 3D bounding boxes of objects. Because the LiDAR point clouds are partial, directly fitting bounding boxes to the point clouds is meaningless. Instead, we suggest that obtaining good results requires sharing information between \emph{all} objects in the dataset jointly, over multiple fram… ▽ More

    Submitted 9 October, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: ICRA 2022 submission

  4. arXiv:1908.07117  [pdf, other

    cs.CV cs.GR

    360-Degree Textures of People in Clothing from a Single Image

    Authors: Verica Lazova, Eldar Insafutdinov, Gerard Pons-Moll

    Abstract: In this paper we predict a full 3D avatar of a person from a single image. We infer texture and geometry in the UV-space of the SMPL model using an image-to-image translation method. Given partial texture and segmentation layout maps derived from the input view, our model predicts the complete segmentation map, the complete texture map, and a displacement map. The predicted maps can be applied to… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  5. arXiv:1810.09381  [pdf, other

    cs.CV cs.LG

    Unsupervised Learning of Shape and Pose with Differentiable Point Clouds

    Authors: Eldar Insafutdinov, Alexey Dosovitskiy

    Abstract: We address the problem of learning accurate 3D shape and camera pose from a collection of unlabeled category-specific images. We train a convolutional network to predict both the shape and the pose from a single image by minimizing the reprojection error: given several views of an object, the projections of the predicted shapes to the predicted camera poses should match the provided views. To deal… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

  6. arXiv:1710.10000  [pdf, other

    cs.CV

    PoseTrack: A Benchmark for Human Pose Estimation and Tracking

    Authors: Mykhaylo Andriluka, Umar Iqbal, Eldar Insafutdinov, Leonid Pishchulin, Anton Milan, Juergen Gall, Bernt Schiele

    Abstract: Human poses and motions are important cues for analysis of videos with people and there is strong evidence that representations based on body pose are highly effective for a variety of tasks such as activity recognition, content retrieval and social signal processing. In this work, we aim to further advance the state of the art by establishing "PoseTrack", a new large-scale benchmark for video-bas… ▽ More

    Submitted 10 April, 2018; v1 submitted 27 October, 2017; originally announced October 2017.

    Comments: www.posetrack.net

  7. arXiv:1701.00142  [pdf, other

    cs.CV

    EgoCap: Egocentric Marker-less Motion Capture with Two Fisheye Cameras (Extended Abstract)

    Authors: Helge Rhodin, Christian Richardt, Dan Casas, Eldar Insafutdinov, Mohammad Shafiei, Hans-Peter Seidel, Bernt Schiele, Christian Theobalt

    Abstract: Marker-based and marker-less optical skeletal motion-capture methods use an outside-in arrangement of cameras placed around a scene, with viewpoints converging on the center. They often create discomfort by possibly needed marker suits, and their recording volume is severely restricted and often constrained to indoor scenes with controlled backgrounds. We therefore propose a new method for real-ti… ▽ More

    Submitted 31 December, 2016; originally announced January 2017.

    Comments: Short version of a SIGGRAPH Asia 2016 paper arXiv:1609.07306, presented at EPIC@ECCV16

  8. arXiv:1612.01465  [pdf, other

    cs.CV

    ArtTrack: Articulated Multi-person Tracking in the Wild

    Authors: Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, Bernt Schiele

    Abstract: In this paper we propose an approach for articulated tracking of multiple people in unconstrained videos. Our starting point is a model that resembles existing architectures for single-frame pose estimation but is substantially faster. We achieve this in two ways: (1) by simplifying and sparsifying the body-part relationship graph and leveraging recent methods for faster inference, and (2) by offl… ▽ More

    Submitted 9 May, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

    Comments: Accepted to CVPR 2017

  9. arXiv:1611.04399  [pdf, other

    cs.CV cs.DM

    Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications

    Authors: Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres

    Abstract: We state a combinatorial optimization problem whose feasible solutions define both a decomposition and a node labeling of a given graph. This problem offers a common mathematical abstraction of seemingly unrelated computer vision tasks, including instance-separating semantic segmentation, articulated human body pose estimation and multiple object tracking. Conceptually, the problem we state genera… ▽ More

    Submitted 21 February, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

  10. arXiv:1609.07306  [pdf, other

    cs.CV

    EgoCap: Egocentric Marker-less Motion Capture with Two Fisheye Cameras

    Authors: Helge Rhodin, Christian Richardt, Dan Casas, Eldar Insafutdinov, Mohammad Shafiei, Hans-Peter Seidel, Bernt Schiele, Christian Theobalt

    Abstract: Marker-based and marker-less optical skeletal motion-capture methods use an outside-in arrangement of cameras placed around a scene, with viewpoints converging on the center. They often create discomfort by possibly needed marker suits, and their recording volume is severely restricted and often constrained to indoor scenes with controlled backgrounds. Alternative suit-based systems use several in… ▽ More

    Submitted 23 September, 2016; originally announced September 2016.

    Comments: SIGGRAPH Asia 2016

  11. arXiv:1605.03170  [pdf, other

    cs.CV

    DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model

    Authors: Eldar Insafutdinov, Leonid Pishchulin, Bjoern Andres, Mykhaylo Andriluka, Bernt Schiele

    Abstract: The goal of this paper is to advance the state-of-the-art of articulated pose estimation in scenes with multiple people. To that end we contribute on three fronts. We propose (1) improved body part detectors that generate effective bottom-up proposals for body parts; (2) novel image-conditioned pairwise terms that allow to assemble the proposals into a variable number of consistent body part confi… ▽ More

    Submitted 30 November, 2016; v1 submitted 10 May, 2016; originally announced May 2016.

    Comments: ECCV'16. High-res version at https://www.d2.mpi-inf.mpg.de/sites/default/files/insafutdinov16arxiv.pdf

  12. arXiv:1511.06645  [pdf, other

    cs.CV

    DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation

    Authors: Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang, Bjoern Andres, Mykhaylo Andriluka, Peter Gehler, Bernt Schiele

    Abstract: This paper considers the task of articulated human pose estimation of multiple people in real world images. We propose an approach that jointly solves the tasks of detection and pose estimation: it infers the number of persons in a scene, identifies occluded body parts, and disambiguates body parts between people in close proximity of each other. This joint formulation is in contrast to previous s… ▽ More

    Submitted 26 April, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: Accepted at IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016)