Skip to main content

Showing 1–8 of 8 results for author: Rockwell, C

.
  1. arXiv:2403.03221  [pdf, other

    cs.CV

    FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation

    Authors: Chris Rockwell, Nilesh Kulkarni, Linyi **, Jeong Joon Park, Justin Johnson, David F. Fouhey

    Abstract: Estimating relative camera poses between images has been a central problem in computer vision. Methods that find correspondences and solve for the fundamental matrix offer high precision in most cases. Conversely, methods predicting pose directly using neural networks are more robust to limited overlap and can infer absolute translation scale, but at the expense of reduced precision. We show how t… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024. Project Page: https://crockwell.github.io/far/

  2. arXiv:2306.07279  [pdf, other

    cs.CV

    Scalable 3D Captioning with Pretrained Models

    Authors: Tiange Luo, Chris Rockwell, Honglak Lee, Justin Johnson

    Abstract: We introduce Cap3D, an automatic approach for generating descriptive text for 3D objects. This approach utilizes pretrained models from image captioning, image-text alignment, and LLM to consolidate captions from multiple views of a 3D asset, completely side-step** the time-consuming and costly process of manual annotation. We apply Cap3D to the recently introduced large-scale 3D dataset, Objave… ▽ More

    Submitted 15 June, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: Dataset link: https://huggingface.co/datasets/tiange/Cap3D

  3. arXiv:2208.08988  [pdf, other

    cs.CV

    The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs

    Authors: Chris Rockwell, Justin Johnson, David F. Fouhey

    Abstract: We present a simple baseline for directly estimating the relative pose (rotation and translation, including scale) between two images. Deep methods have recently shown strong progress but often require complex or multi-stage architectures. We show that a handful of modifications can be applied to a Vision Transformer (ViT) to bring its computations close to the Eight-Point Algorithm. This inductiv… ▽ More

    Submitted 23 January, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to 3DV 2022; Project Page: https://crockwell.github.io/rel_pose/ Revision: Fixed Epipolar Lines in Figure 3, Figure 10

  4. arXiv:2208.04307  [pdf, other

    cs.CV

    PlaneFormers: From Sparse View Planes to 3D Reconstruction

    Authors: Samir Agarwala, Linyi **, Chris Rockwell, David F. Fouhey

    Abstract: We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer,… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted to ECCV 2022

  5. arXiv:2206.08355  [pdf, other

    cs.CV

    FWD: Real-time Novel View Synthesis with Forward War** and Depth

    Authors: Ang Cao, Chris Rockwell, Justin Johnson

    Abstract: Novel view synthesis (NVS) is a challenging task requiring systems to generate photorealistic images of scenes from new viewpoints, where both quality and speed are important for applications. Previous image-based rendering (IBR) methods are fast, but have poor quality when input views are sparse. Recent Neural Radiance Fields (NeRF) and generalizable variants give impressive results but are not r… ▽ More

    Submitted 5 August, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: CVPR 2022. Project website https://caoang327.github.io/FWD/

  6. arXiv:2203.16531  [pdf, other

    cs.CV

    Understanding 3D Object Articulation in Internet Videos

    Authors: Shengyi Qian, Linyi **, Chris Rockwell, Siyi Chen, David F. Fouhey

    Abstract: We propose to investigate detecting and characterizing the 3D planar articulation of objects from ordinary videos. While seemingly easy for humans, this problem poses many challenges for computers. We propose to approach this problem by combining a top-down detection system that finds planes that can be articulated along with an optimization approach that solves for a 3D plane that can explain a s… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  7. arXiv:2108.05892  [pdf, other

    cs.CV

    PixelSynth: Generating a 3D-Consistent Experience from a Single Image

    Authors: Chris Rockwell, David F. Fouhey, Justin Johnson

    Abstract: Recent advancements in differentiable rendering and 3D reasoning have driven exciting results in novel view synthesis from a single image. Despite realistic results, methods are limited to relatively small view change. In order to synthesize immersive scenes, models must also be able to extrapolate. We present an approach that fuses 3D reasoning with autoregressive modeling to outpaint large view… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: In ICCV 2021

  8. arXiv:2008.06046  [pdf, other

    cs.CV

    Full-Body Awareness from Partial Observations

    Authors: Chris Rockwell, David F. Fouhey

    Abstract: There has been great progress in human 3D mesh recovery and great interest in learning about the world from consumer video data. Unfortunately current methods for 3D human mesh recovery work rather poorly on consumer video data, since on the Internet, unusual camera viewpoints and aggressive truncations are the norm rather than a rarity. We study this problem and make a number of contributions to… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: In ECCV 2020