Skip to main content

Showing 1–6 of 6 results for author: Ye, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.02121  [pdf, ps, other

    cs.MS cs.CV cs.GR math.NA

    Mathematical Supplement for the $\texttt{gsplat}$ Library

    Authors: Vickie Ye, Angjoo Kanazawa

    Abstract: This report provides the mathematical details of the gsplat library, a modular toolbox for efficient differentiable Gaussian splatting, as proposed by Kerbl et al. It provides a self-contained reference for the computations involved in the forward and backward passes of differentiable Gaussian splatting. To facilitate practical usage and development, we provide a user friendly Python API that expo… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Find the library at: https://docs.gsplat.studio/

  2. arXiv:2306.09337  [pdf, other

    cs.CV

    Generative Proxemics: A Prior for 3D Social Interaction from Images

    Authors: Lea Müller, Vickie Ye, Georgios Pavlakos, Michael Black, Angjoo Kanazawa

    Abstract: Social interaction is a fundamental aspect of human behavior and communication. The way individuals position themselves in relation to others, also known as proxemics, conveys social cues and affects the dynamics of social interaction. Reconstructing such interaction from images presents challenges because of mutual occlusion and the limited availability of large training datasets. To address this… ▽ More

    Submitted 12 December, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Project website: muelea.github.io/buddi

  3. arXiv:2302.12827  [pdf, other

    cs.CV

    Decoupling Human and Camera Motion from Videos in the Wild

    Authors: Vickie Ye, Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa

    Abstract: We propose a method to reconstruct global human trajectories from videos in the wild. Our optimization method decouples the camera and human motion, which allows us to place people in the same world coordinate frame. Most existing methods do not model the camera motion; methods that rely on the background pixels to infer 3D human motion usually require a full scene reconstruction, which is often n… ▽ More

    Submitted 20 March, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Project site: https://vye16.github.io/slahmr. CVPR 2023

  4. arXiv:2204.07151  [pdf, other

    cs.CV

    Deformable Sprites for Unsupervised Video Decomposition

    Authors: Vickie Ye, Zhengqi Li, Richard Tucker, Angjoo Kanazawa, Noah Snavely

    Abstract: We describe a method to extract persistent elements of a dynamic scene from an input video. We represent each scene element as a \emph{Deformable Sprite} consisting of three components: 1) a 2D texture image for the entire video, 2) per-frame masks for the element, and 3) non-rigid deformations that map the texture image into each video frame. The resulting decomposition allows for applications su… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: CVPR 2022 Oral. Project Site: https://deformable-sprites.github.io

  5. arXiv:2012.02190  [pdf, other

    cs.CV cs.GR cs.LG

    pixelNeRF: Neural Radiance Fields from One or Few Images

    Authors: Alex Yu, Vickie Ye, Matthew Tancik, Angjoo Kanazawa

    Abstract: We propose pixelNeRF, a learning framework that predicts a continuous neural scene representation conditioned on one or few input images. The existing approach for constructing neural radiance fields involves optimizing the representation to every scene independently, requiring many calibrated views and significant compute time. We take a step towards resolving these shortcomings by introducing an… ▽ More

    Submitted 30 May, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: CVPR 2021

  6. arXiv:1907.03680  [pdf, other

    math.OC cs.LG stat.ML

    Robust Guarantees for Perception-Based Control

    Authors: Sarah Dean, Nikolai Matni, Benjamin Recht, Vickie Ye

    Abstract: Motivated by vision-based control of autonomous vehicles, we consider the problem of controlling a known linear dynamical system for which partial state information, such as vehicle position, is extracted from complex and nonlinear data, such as a camera image. Our approach is to use a learned perception map that predicts some linear function of the state and to design a corresponding safe set and… ▽ More

    Submitted 23 December, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: This revision includes reframing the local generalization problem, with relaxed the assumptions so that the robust problem depends on a local slope bound rather than a Lipschitz constant, and provide a method for learning the slope bound from data. We also include additional experiments with a CNN perception module